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VO (54) Title: METHODS FOR PRODUCING SULPHUROUS FINE CHEMICALS 
00 

(54) Bezeichnung: VERFAHREN ZUR HERSTELLUNG VON SCHWEFELHALTIGEN FEINCHEM1KALIEN 

00 

^ (57) Abstract: The invention relates to methods for producing sulphurous fine chemicals, in particular L-methionine, by fermenta- 
lion, using bacteria, in which a nucleotide sequence that codes for a methionine synthase (metF) gene is expressed. 



Q (57) Zusammenfassung: Die Erfindung betrilR Verfahren zur fermenlativen Herstellung von schwefelhaltigen Feinchemikalien, 
insbesondere L-Methionin, unterVe ~ 
kleotidsequenzen exprimiert wird. 



^ insbesondere L-Methionin, unter Verwendung von Bakterien, in denen eine fur ein Methionin-Synthase (metF)-Gen kodierende Nu- 
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Verfahren zur Herstellung von schwefeihaltlgen Feinchemlkalien| 

Bgschrelbung 

Gegenstand der Erfindung ist ein Verfahren zur fermentativen Herstellung von schwefelhaltigen 
Feinchemikalien, insbesondere L-Methionin, unter Verwendung von Bakterien, in denen eine fDr 
ein Methionin-Synthase (metH)-Gen kodierende Nukleotidsequenz exprimiert wird. 

Stand der Technik 

Schwefelhaltige Feinchemikalien, wie zurn Beispiel Methionin, Homocystein, S-Adenosyi- 
Methionin, Glutathion, Cystein, Biotin, Thiamin, Lipons3ure werden Qber natQrliche Stoffwech- 
selprozesse in Zelten hergestellt und werden in vielen Industriezweigen verwendet, einschlieB- 
lich der Nahrungsmittel-, Futtermittel-, Kosmetik- und pharmazeutischen Industrie. Diese Sub- 
stanzen, die zusammen als "schwefelhaltige Feinchemikalien" bezeichnet werden, umfassen 
organische Sauren, sowohl proteinogene als auch nicht-proteinogene Arninosauren, Vitamine 
und Cofaktoren. Ihre Produktion erfolgt am zweckmaBigsten im GroBmaBstab mittels Anzucht 
von Bakterien, die entwickelt wurden, urn gnoBe Mengen der jeweils gewQnschten Substanzzu 
produzieren und sezemieren. FOr diesen Zweck besonders geeignete Organismen sind coryne- 
forme Bakterien, gram-positive nicht-pathogene Bakterien. 

Es ist bekannt, dass Arninosauren durch Fermentation von Stammen coryneformer Bakterien, 
insbesondere Corynebacterium glutamicum, hergestellt werden. Wegen der groBen Bedeutung 
wird standig an der Verbesserung der Herstellverfahren gearbeitet Verfahrensverbesserungen 
kdnnen fermentationstechnische MaBnahmen, wie zurn Beispiel ROhrung und Versorgung mit 
Sauerstoff, oder die Zusammensetzung der Nahrmedien, wie zurn Beispiel die Zuckerkonzentra- 
tion wahrend der Fermentation, Oder die Aufarbeitung zurn Produkt , beispielsweise durch lone- 
naustauschchromatographie, oder die intrinsischen Leistungseigenschaften des Mikroorganis- 
mus selbst betreffen, 

Ober Stammselektion sind eine Reihe von Mutantenstammen entwickelt worden, die ein Sorti- 
ment wOnschenswerter Verbindungen aus der Reihe der schwefelhaltigen Feinchemikalien pro- 
duzieren. Zur Verbesserung der Leistungseigenschaften dieser Mikroorganismen hinsichtlich der 
Produktion eines bestimmten MolekQIs werden Methoden der Mutagenese, Selektion und Mut- 
antenauswahl angewendet. Dies ist jedoch ein zeitaufwendiges und schwieriges Verfahren. Auf 
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diese Weise erhalt man z.B. StSmme, die resistent gegen Antimetabolite Oder Hemmstoffe, vine 
z. B. die Methionin-Analoga a-Methyl-Methionin, Ethionin, Norleucin, N-Acetylnorleucin, S- 
Trifluoromethylhomocystein, 2-Amino-5-hepreno'rtsaure, Seleno-Methionin, Methioninsulfoximln, 
Methoxin, 1-Aminocyclopentan-Carboxylsaure oder auxotroph fOr regulatorisch bedeutsame 
5 Metabolite sind und schwefelhaltige Feinchemikalien, wie z. B. L-Methionin, produzieren. 

Seit einigen Jahren werden ebenfalls Methoden der rekombinanten DNA-Technik zur Stamm- 
verbessewng von L-AminosSure produzierender Stamme von Corynebacterium eingesetzt, in- 
dem man einzelne Aminosaure-Biosynthesegene amplifiziert und die Auswirkung auf die Amino- 
1 0 sSure-Produktion untersucht. 

Die WO-A-02/10209 beschreibt ein Verfahren zur fermentativen Herstellung von L-Methionin 
unter Verwendung L-Methionin produzierender coryneformer Bakterien, worin wenigstens das 
metH-Gen Oberexprimiert ist, sowie die kodierende metH-Sequenz aus C. glutamicum ATCC 
15 13032. 

Kurze Beschreibung der Erfindung 

Der Erfindung lag die Aufgabe zugrunde, ein neues Verfahren zur verbesserten fermentativen 
20 Herstellung von schwefelhaltige Feinchemikalien, insbesondere L-Methionin, bereitzustellen. 

GelSst wird obige Aufgabe durch Bereitstellung eines Verfahrens zur fermentativen Herstellung 
einerschwefelhalfigen Feinchemikalie, umfassend die Expression einer heterologen Nukleotid- 
sequenz, welche fQr ein Protein mit metH-Aktivitat kodiert, in einem coryneformen Bakterium. 

25 

Ein erster Gegenstand der Erfindung ist ein Verfahren zur fermentativen Herstellung wenigstens 
einerschwefelhalfigen Feinchemikalie, welches folgende Schritte umfasst 

a) Fermentation einer die gewunschte schwefelhaltige Feinchemikalie produzierenden co- 
ryneformen Bakterienkultur, wobei in den coryneformen Bakterien zumindest eine hetero- 

30 loge Nukleotidsequenz exprimiert wird, welche fQr ein Protein mit Methionin-Synthase 

(metH)-Aktivitat kodiert; 

b) Anreicherung der schwefelhaltigen Feinchemikalie im Medium oder in den Zellen der 
Bakterien, und 
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c) Isolieren der schwefelhaltigen Feinchemikalie, welche vorzugsweise L-Methionin uro- 
fasst 



Vorzugsweise besitzt obige heterologe metH-kodierende Nukleotidsequenz zur metH- 
5 kodierenden Sequenz aus Corynebacterium glutamicum ATCC 1 3032 eine Sequenzhomologie 
vom weniger als 70% aufweist Die metH-kodierende Sequenz ist vorzugsweise aus einem der 
folgenden Organismen von Liste I abgeleitet 



Liste I 

10 



Streptomyces coelicolor 


ATCC 10147 


Anabaena sp. 


ATCC 27892 


Synechocystis sp. 


ATCC 27184 


Prochlorococcus marinus 


PCC7118 


Thermus thermophilus 


ATCC 27634 


Bacillus halodurans 


ATCC 21591 


Bacillus stearothermophilus 


ATCC 12980 


Vibrio cholerae 


ATCC 39315 


Sinortiizobium meliloti 


ATCC 4399 


Escherichia coli K12 


ATCC55151 I 


Salmonella typhimurium 


ATCC 15277 


Salmonella typhi 


ATCC 12839 


Pseudomonas fluorescens 


ATCC 13525 


Pseudomonas aeruginosa 


ATCC 17933 


Nitrosomonas europeae 


ATCC 19718 


Bordeteila pertussis 


ATCC 9797 


Clorobium tepidum 


ATCC 49652 


Deinococcus radiodurans 


ATCC 13939 I 


Clostridium acetobutylicum 


ATCC 824 


Caulobacter crescentus 


ATCC 19089 


Homo sapiens 




Vibrio fischeri 


ATCC 33715 


Agrobacterium tumefaciens str. C58 (Cereon) 


ATCC 33970 


Ralstonia solanacearum 


ATCC 25237 



ATCC: American Type Culture Collection, Rockville, MO, USA 
PCC: Pasteur Culture Collection of Cyanobacteria. Paris Frankreich 

15 Die erfindungsgemail eingesetzte metH-kodierende Sequenz umfasst vorzugsweise eine kodie- 
rende Sequenz gemafi SEQ ID NO: 1 , 3, 5, 7, 9, 1 1 , 1 3, 1 5, 1 7, 1 9. 21 , 23, 25, 27, 29, 31 , 33, 35, 
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37, 39, 41, 43, 45, 47, 49 und 51 oder eine dazu homologe Nukleotidsequenz, welche fQr ein 
Protein mit metH-Aktivitat kodiert 

Die erfindungsgemad elngesetzte metH-kodierende Sequenz kodiert auBerdem vorzugsweise 
5 fQr ein Protein mit metH-Aktivitat, wobei das Protein eine Aminosauresequenz gem§B SEQ ID 
NO:2, 4, 6, 8, 10, 12, 14, 16, 1 8, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40. 42. 44, 46, 48, 50 und 
52 oder eine dazu homologe Aminosauresequenz, welche for ein Protein mit metH-Aktivitat 
steht, umfasst 

10 Die kodierende metH-Sequenz ist vorzugsweise eine in coryneformen Bakterien replizierbare 
oder eine stabil in das Chromosom intregrierte DNA oder eine RNA. 

Gemaii einer bevorzugten AusfQhrungsform wird das erfindungsgemaBe Verfahren durchge- 
fOhrt, indem man 

15 

a) einen miteinem Plasmidvektortransformierten Bakterienstammeinsetztderwenigstens 
eine Kopie der kodierenden metH-Sequenz unter der Kontrolle regulativer Sequenzen trSgt oder 

b) einen Stamm einsetzt, in dem die kodierende metH-Sequenz in das Chromosom des 
Bakteriums integriert wurde. 

20 

Es ist weiterhin bevorzugt, die kodierende metH-Sequenz fOr die Fermentation zu Oberexprimie- 
ren. 

AuBerdem kann es wOnschenswert sein, Bakterien zu fermentieren, in denen zusatzlich wenigs- 
25 tens ein weiteres Gen des Biosyntheseweges der gewOnschten schwefelhaltigen Feinchemikalie 
oder eines damit assoziierten Biosynthese- oder sonstigen Stoflwechselweges, verstarict ist; und 
/oder 

in denen wenigstens ein Stoffwechselweg zumindest teilweise ausgeschaltet sind, der die BN- 
dung der gewOnschten schwefelhaltigen Feinchemikalie vemngert 

30 

AuBerdem kann es wOnschenswert sein, Bakterien zu fermentieren, in denen zusatzlich wenigs- 
tens ein weiteres Gen des Biosyntheseweges der gewOnschten schwefelhaltigen Feinchemikalie 
durch Stoffwechselmetabolite in seiner Aktivitat nicht in unerwOnschter Weise beeinflusst wird. 
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Gemaii einer weiteren AusfQhrungsform des erfindungsgemSIXen Verfahrens werden deshalb 
coryneforme Bakterien fermentiert, in denen gleichzeitig wenigstens eines der Gene, ausgewShlt 
unter 

a) dem fQr eine Aspartatklnase kodierenden Gen lysC, 

b) dem fQr eine Aspartat-Semialdehyd-Dehydrogenase kodierenden Gen asd 

c) dem fQr die Glycerinaldehyd-3-Phosphat Dehydrogenase kodierenden Gen gap, 

d) dem fQr die 3-Phosphoglycerat Kinase kodierenden Gen pgk, 

e) dem for die Pyruvat Carboxylase kodierenden Gen pyc t 

f) dem far die Triosephosphat Isomerase kodierenden Gen tpi, 

g) dem fQr die Homoserin O-Acetyttransferase kodierenden Gen metA, 

h) dem fQr die Cystathionin-gamma-Synthase kodierenden Gen metB, 

i) dem fQr die Cystathionin-gamma-Lyase kodierenden Gen metC, 
j) dem fQr die Serin-Hydroxymethyltransferase kodierenden Gen glyA, 
k) dem fQr die OAcetylhomoserin-Stilfhydryiase kodierenden Gen metY, 
I) dem fQr die Methyten-Tetrahydrofolat-Reduktase kodierenden Gen, metF 
m) dem fQr die Phosphoserin-Aminotransferase kodierenden Gen serC 
n) dem fQr die Phosphoserin-Phosphatase kodierenden Gen serB, 
o) dem fQr die Serine Acetyl-Transferase kodierenden Gen cysE, 
p) dem fQr die Homoserin-Dehydrogenase kodierenden Gen horn, 

Qberexprimiert ist 

GemSIJ einer anderen AusfQhrungsform des erfindungsgemSBen Verfahrens werden corynefor- 
25 me Bakterien fermentiert, in denen gleichzeitig wenigstens eines der Gene ausgewShlt unter 
Genen der oben genannten Gruppe a) bis p) mutiert ist, insbesondere so dass die korrespondie- 
renden Proteine, verglichen mit nicht mutierten Proteinen, in geringerem MaBeodemichtdurch 
Stoffwechselmetabolite in ihrer Aktivitat beeinflusst werden und dass insbesondere die erfin- 
dungsgemafle Produktion der Feinchemikalie nicht beeintrachtigt wind. Durch die Mutafion kann 
30 das Protein auch eine hfihere Aktivitat (Sunstratumsatz) und/oder Sunstratspezifitat besitzen und 
damit die Produktion der gewunschten Feinchemikalie fflrdem. 
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6em§B eineranderen AusfQhrungsform des erfindungsgema&en Verfahrens werden corynefbr- 
me Bakterien fermentiert, in denen gleichzeitig wenigstens eines der Gene, ausgewShlt unter 
q) dem fOr die Homoserine-Kinase kodierenden Gen thrB, 
r) dem fOr die Threonin Dehydratase kodierenden Gen ilvA, 
s) dem fQr die Threonin Synthase kodierenden Gen thrC 
t) dem fQr die Meso-Diaminopimelat D-Dehydrogenase kodierenden Gen ddh 
u) dem fQr die Phosphoenolpyruvat-Carboxykinase kodierenden Gen pck. 
v) dem fOr die Glucose-6-Phosphat-6-lsomerase kodierenden Gen pgi, 
w) dem fOr die Pyruvat-Oxidase kodierenden Gen poxB, 
x) dem fQr die Dihydrodipicolinat Synthase kodiemden Gen dapA, 
y) dem fQr die Dihydrodipicolinat Reduktase kodiemden Gen dapB; oder 
z) dem fQr die Dtaminopicolinat Decarboxylase kodiemden Gen lysA 
abschwdcht ist, insbesondere durch Verringerung der Expressionsrate des konrespondierenden 
Gens, oder durch Expression eines Proteins mit geringerer Aktivitat (Substratumsatz). 

Gemaii einer anderen AusfQhrungsform des erfindungsgemaBen Verfahrens werden corynefbr- 
me Bakterien fermentiert, in denen gleichzeitig wenigstens eines der Gene derobigen Gruppen 
q) bis z) mutiert ist, so dass die enzymatische Aktivitat des konrespondierenden Proteins teilwei- 
se oder vollstdndig verringert wind. 

Vorzugsweise werden in dem erfindungsgemaBen Verfahren Mikroorganismen der Art Coryne- 
bacterium glutamicum eingesetzt 

In einer weiteren AusfQhrungsform des Verfahrens werden solche Mikroorganismen eingesetzt, 
die ResistenzgegenOber wenigstens einen Methionin-Biosynthesehemmer aufweisen. Solche 
Hemmer sind, ohne darauf beschrSnkt zu sein, Methionin-Analoga, wie a-Methyl-Methionin, 
Ethionin, Norieucin, N-Acetylnorleucin, S-Trifluoromethylhomocystein, 2-Amino-5- 
heprenoitsaure, Seleno-Methionin, Methioninsulfoximin, Methoxin, und 1-Aminocyclopentan- 
Carboxylsaure. 

Ein weiterer Gegenstand der Erfindung betriffl eln Verfahren zur Herstellung eines L-Methionin- 
haltigen Tierfuttermittel-Additivs aus FermentationsbrQhen, welches folgende Schritte umfasst 
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a) Kultivierung und Fermentation eines L-Methionin produzierenden Mikroorganis- 
mus in einem Fermentationsmedium; 

b) Entfemung von Wasser aus der L-Methionin haltigen FermentationsbrOhe; 

c) Entfemung der wahrend der Fermentation gebildeten Biomasse in einer Menge 
von 0 bis 1 00 Gew.-%; und 

d) Trocknung der gemSfi b) und/oder c) erhaltenen FermentationsbrOhe, urn das 
Tierf uttermittel-Additiv in der gewOnschten Pulver- Oder Granulatfonm zu erhalten. 

Gegenstand der Erfindung sind ebenfalls die erstmalig aus obigen Mikroorganismen isolierten 
kodierenden metH-Sequenzen, die davon kodierten Methionin-Synthasen sowie die funktionalen 
Homologen dieser Polynukleotide bzw. Proteine. 

Gegenstand der Erfindung sind insbesondere auch die zur DurchfOhrung oblger Verfahren not- 
wendigen Expressionskonstrukte und Mikroorganismen. 

Weitere GegenstSnde der Erfindung sind somit insbesondere: 

- das Plasmid pCIS lysC thr31 1 ile, kodierend fOr lysC thr31 1 ile Oder ein funktionales Aquivalent 
davon, d.h. eine lysC-Mutante mit vergleichbarer, gegenOber dem Wildtyp erhflhter Aspartatki- 
nase-Aktivitat; 

- ein Wirtsorganismus transformiert mit dem Plasmid pCIS lysC thr31 1 ile, insbesondere ausge- 
wahlt unter Mikroorganismen der Gattung Corynebacterium, insbesondere der Art C. glutaml- 
cum, wie der transformierte Stamm LU1479 lysC 31 1 ile; 

- das Plasmid pC Phsdh metH_Sc, kodierend fOr metH aus Streptomyces coelicolor, 

- ein Wirtsorganismus gemSB obiger Definition, transformiert mit einem Plasmid, kodierend fOr 
exgonenes metH; insbesondere transformiert mit dem Plasmid pC Phsdh metH_Sc; 

- ein Wirtsorganismus gemaii obiger Definiiton mit Resistenz gegen wenigstens einen Methio- 
nin-Biosynthesehemmstoff, wie der transformierte Stamm LU1479 lysC 311ile ET-16, gege- 
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benenfalls transformiertmiteinerexogenen kodierenden metH Sequent wiedertransformierte 
Stamm LU1479 lysC 31 lile ET-16 pC Phsdh metH_Sc. 

Detaillierte Beschreibung der Erfinduno , 

a) Allgemeine Begriffe 

Als Proteine mit der biologischen AktrvitSt der Methionine-Synthase, kurz auch als metH genannt 
(systematische Bezeichnung: 5-Methyltetrahydrofolat-Homocystein S-Methyltransferase ;EC 
2.1 .1 .1 3), werden solche Proteine bezeichnet, die in der Lage sind Homocystein unter Verwen- 
dung der Cofaktoren 5-Methyttetrahydrofolat (MTHF), Cobalamin (Vitamin B12) und S-Adenosyf- 
Methionin zu Methionin und Tetrahydrofolat umzusetzen. wahrend der Cofaktor 5- 
Methyltetrahydrofolat stdchiometrisch in die Reaktion mit eingeht (1mol MTHF/1Mol Methionin 
gebildet) wird, wie in der Literatur beschrieben,S-Adenosy1-Methionin substBchiometrisch urn- 
gesetzt Cobalamin hingegen 1st katalytisch an der Umsetzung beteiligt. Dem Fachmann sind 
weitere Details des metH-Proteins bekannt. (Baneijee R.V., Matthews R.G. FASEB J. 4:1450- 
1459, 1990, Ludwig ML Matthews RG. Annual Review of Biochemistry. 66:269-313, 1997, 
Drennan CL. Matthews RG. Ludwig ML. Current Opinion in Structural Biology. 4:919-29, 1994). 
Der Fachmann unterscheldet die Aktivitat der Cobalamin-abhangigen 5-Methyltetrahydrofolat- 
Homocystein S-Methyltransferase von der der Cobalamin-unabhSngigen 5-Methyltetrahydro- 
Reroyltriglutamat-Homocystein S-Methyltransferase (EC 2.1.1.14) auch metE genannt Der 
Fachmann kann die enzymatische Aktivitat von metH durch Enzymtests nachweisen, Vorschrif- 
ten dafOr k6nnen sein: Jarrett JT. Goulding CW. Fluhr K. Huang S. Matthews RG. Methods in 
Enzymology. 281:196-213, 1997. 

Im Rahmen der vortiegenden Erfindung umfasst der Begriff „schwefelhaltige Feinchemikalie" 
jegliche chemische Verbindung, die wenigstens ein Schwefelatom kovalent gebunden enthait 
und durch ein erfindungsgemafJes Fermentationsverfahrens zuganglich ist Nichtlimitierende 
Beispiele dafur sind Methionin, Homocystein, S-Adenosyt-Methionin, insbesondere Methio- 
nin.und S-AdenosyJ-Methionin. 
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Im Rahmen dervorliegenden Erfindung umfassen die Begriffe L-Methionin, Methionin, Homo- 
cystein und S-Adenosylmethionin auch die korrespondierenden Salze, wie z. B. Methionln- 
Hydrochlorid oder Methionin-Sulfat 

5 "Polynukleotide" bezeichnet im allgemeinen Polyribonukleotide (RNA) und Polydeoxyribonukleo- 
tide (DNA), wobei es sich urn nicht mbdifizierte RNA oder DNA oder modifizierte RNA oder DNA 
handeln kann. 

Unter "Polypeptide^ versteht man erfindungsgemaB Peptide oder Proteine, die zwei oder mehr 
1 0 Ober Peptidbindungen verbundene Aminosauren enthalten. 

Der Begriff ..Stoffwechselmetabolir bezeichnet chemische Verbindungen, die im Stoffwechsel 
"von Organismen als Zwischen- oder auch Endprodukte vorkommen und die neben ihrer Eigen- 
schatt als chemische Bausteine auch modulierende Wirkung auf Enzyme und ihre katalytische 

1 5 Aktivitat haben kGnnen. Dabei ist aus der Literatur bekannt, dass solche Stoffwechselmetabolite 
sowohl hemmend als auch stimulierend auf die Aktvitat von Enzymen wiricen k6nnen (Bioche- 
mistry, Stryer, Lubert, 1995 W. H. Freeman & Company, New York, New York.). In der Literatur 
ist auch beschrieben, dass es mdglich ist durch Malinahmen wie Mutation der genomischen 
DNA durch UV-Strahlung, ionisierender Strahlung oder mutagene Substanzen und nachfolgen- 

20 der Selektion auf bestimmte PhSnotypen in Organismen solche Enzyme zu produzieren, In de- 
nen die Beeinflussung durch Stoflwechselmetabolite ver§ndert wurde (Sahm H. Eggeling L de 
Graaf AA. Biological Chemistry 381(9-10):899-910, 2000; Eikmanns BJ. Eggeling L Sahm H. 
Antonie van Leeuwenhoek. 64:145-63, 1 993-94). Diese verSnderten Eigenschaften Wnnen auch 
durch gezielte Malinahmen erreicht werden, Dabei ist dem Fachmann bekannt, in Genen fOr 

25 Enzyme auch gezielt bestimmte Nukleotide der fOr das Protein kodierenden DNA so zu verfln- 
dem, dass das aus der exprimierten DNA-Sequenz resultierende Protein bestimmte neue Ei- 
genschaften aufweist, so zum Beispiel, dass die modulierende Wirkung von Stoffwechselmeta- 
boliten gegenOber dem nicht veranderten Protein verandert ist 

30 Enzyme kdnnen derart in ihrer Aktivitat beeinfludt werden, dass es zu einer Vemngerung der 
Reaktionsgeschwindigkeit, oder zu einer VerSnderung der Affinitat gegenuber dem Substrat 
oder zu einer Anderung der Reaktionsgeschwindigkeiten kommt 



WO 03/087386 



PCT/EP03/04010 



10 

Die Begriffe "exprimieren" bzw. "Verstarkung" oder .Oberexpression" beschreiben im Kontext der 
Erfindung die Produktion bzw. ErhChung der intrazeliulfiren Aktivitat eines oder mehrerer Enzy- 
me in einem Mikroorganismus, die durch die entsprechende DNA kodiert wenden. Dazu kann 
man beispieisweise ein Gen in einen Organismus einbringen, e(n vortiandenes Gen durch ein 
anderes Gen ersetzen, die Kopienzahl des Gens bzw. der Gene ertidhen, einen starken Promo- 
ter verwenden Oder ein Gen verwenden, das fOr ein entsprechendes Enzym mit einer hohen 
Aktivitat kodiert und man kann gegebenenfalls diese MaBnahmen kombinieren. 

b) ErfindungsgemaUe metH-Proteine 

Erfindungsgemau mit umfasst sind ebenfalls .funktionale Aqufvalente" der konkret offenbarten 
metH-Enzyme aus Organismen obiger Liste I. 

.Funktionale Aquivalente" oder Analoga der konkret offenbarten Polypeptide sind im Rahmen 
der vorliegenden Erfindung davon verschiedene Polypeptide, welche weiterhin die gewflnschte 
biologische Aktivitat, wie z.B. Substratspezifitat, besitzen. 

Unter "funktionalen Aquivalenten" versteht man erfindungsgemaB insbesondere Mutanten, wel- 
che in wenigstens einer der oben genannten Sequenzpositionen eine andere als die konkret 
genannte Aminosaure aufweisen aber trotzdem eine der oben genannten biologischen Aktivita- 
ten besitzen. "Funktionale Aquivalente" umfassen somitdie durch eine oder mehrere Aminosau- 
re-Additionen, -Substitutionen, -Deletionen und/oder-lnversionen erhaitlichen Mutanten, wobei 
die genannten Ver3nderungen in Jeglicher Sequenzposition auftreten kSnnen, solange sie zu 
einer Mutante mit dem erfindungsgemaiXen Eigenschaftsprofil fQhren. Funktionale Aquivalenz ist 
insbesondere auch dann gegeben, wenn die Reaktivitatsmuster zwischen Mutante und unver- 
andertem Polypeptid qualitativ Obereinstimmen, d.h. beispieisweise gleiche Substrate mit unter- 
schiedlicher Geschwindigkeit umgesetzt werden. 

"Funktionale Aquivalente" umfassen natOrlich auch Polypeptide welche aus anderen Organis- 
men zuganglich sind, sowie natOrlich vorkommende Varianten. Beispieisweise lassen sich durch 
Sequenzvergleich Bereiche homologer Sequenzregionen festlegen und in Anlehnung an die 
konkreten Vorgaben der Erfindung aquivalente Enzyme ermitteln. 
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.Funktionale Aquivalente* umfassen ebenfalls Fragmente, vorzugsweise einzelne DomSnen 
oder Sequenzmotive, der erfindungsgemSBen Polypeptide, welche z.B. die gewOnschte biologi- 
sche Funktion aufweisen. 

5 .Funktionale Aquivalente" sind aulierdem Fusionsproteine, welche ein der oben genannten Po- 
lypeptidsequenzen oder davon abgeleitete funktionale Aquivalente und wenigstens eine weitere, 
davon funktionell verschiedene, heterologe Sequenz in funktioneller N- oder Otenminaler Ver- 
knQpfung (d.h. ohne gegenseitigen wesentliche funktionelle Beeintrachtigung der Fusionsprote- 
inteile) aufweisen. Nichtlimitiemde Beispiele fOr derartige heterologe Sequenzen sind z.B. Sig- 
1 0 nalpeptide. Enzyme. Immunoglobuline, Oberfiachenantigene, Rezeptoren oder Rezeptorligan- 
den. 

Erfindungsgemaa mit umfasste .funktionale Aquivalente' sind Homologe zu den konkret offen- 
barten Proteinen. Diese besitzen, beispielsweise Ober die gesamte Lflnge, wenigstens 30%, 
1 5 oder etwa 40%, 50 %, vorzugsweise wenigstens etwa 60 %, 65%, 70%, oder 75% ins besondere 
wenigsten 85 %, wie z.B. 90%, 95% oder 99%, Homologiezu einer der konkret offenbarten Se- 
quenzen, berechnet nach dem Algorithmus von Pearson und Lipman, Proc. Natl. Acad, Sci. 
(USA) 85(8), 1 988, 2444-2448. Der Homologiegrad spiegelt insbesondere den Grad der Identitat 
zwischen verSnderter und nicht verSnderter Sequenz wider. 

20 

Homologe der erfindungsgemaiJen Proteine oder Polypeptide kdnnen durch Mutagenese er- 
zeugt werden, z.B. durch Punktmutation oder VerkOrzung des Proteins. Der Begriff 'Homolog*. 
wie er hier verwendet wird, betrifft auch eine variante Form des Proteins, die als Agonist oder 
Antagonist der Protein-Aktivitat wirict. 

25 

Homologe des erfindungsgemaiien Proteine k6nnen durch Screening kombinatorischer Banken 
von Mutanten, wie z.B. VerkOrzungsmutanten, identifiziert werden. Beispielsweise kann eine 
variegierte Bank von Protein-Varianten durch kombinatorische Mutagenese auf Nukleinsaure- 
ebene erzeugt werden, wie z.B. durch enzymatisches Ligieren eines Gemisches synthetischer 
30 Oligonukleotide. Es gibt eine Vielzahl von Verfahren, die zur Herstellung von Banken potentieller 
Homologer aus einer degenerierten Oligonukleotidsequenz verwendet werden kSnnen. Die 
chemische Synthese einer degenerierten Gensequenz kann in einem DNA-Syntheseautomaten 
durchgefQhrt werden, und das synthetische Gen kann dann in einen geeigneten Expressions- 
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vektor ligiert werden. Die Verwendung eines degenerierten Gensatzes ermdglicht die Bereitstel- 
lung sSmtlicher Sequenzen in einem Gemisch, die den gewOnschten Satz an potentiellen Prote- 
insequenzen codieren. Verfahren zur Synthese degenerierter Oligonukleotide sind dem Fach- 
mann bekannt (Z.B. Narang, SA (1983) Tetrahedron 39:3; Itakura etal. (1984) Annu. Rev. Bio- 
5 chem. 53:323; Itakura et al., (1984) Science 198:1056; Ike et at. (1983) Nucleic Acids Res. 
11:477). 

Zusatzlich kflnnen Banken von Fragmenten des Protein-Codons verwendet werden, urn eine 
variegierte Population von Protein-Fragmenten zum Screening und zur anschlieBenden Selektt- 

1 0 on von Homologen eines erfindungsgemSBen Proteins zu erzeugen. Bei einer AusfOhmngsform 
kann eine Bank von kodierenden Sequenzfragmenten durch Behandeln eines doppelsWngigen 
PCR-Fragmentes einer kodierenden Sequenz mit einer Nuklease unter Bedingungen, unter de- 
neri ein Nicking nur etwa einmal pro MolekOI erfolgt, Denaturieren der doppelstrflngigen DNA, 
Renaturieren der DNA unter Bildung doppelstrSngiger DNA, die Sense-7Antisense-Paare von 

1 5 verschiedenen genickten Produkten umfassen kann, Entfemen einzelstrSngiger Abschnitte aus 
neu gebildeten Duplices durch Behandlung mit S1 -Nuclease und Ligieren der resultierenden 
Fragmentbank in einen Expressionsvektor erzeugt werden. Durch dieses Verfahren kann eine 
Expresslonsbank hergeleitet werden, die N-terminale, C-terminale und interne Fragmente mit 
verschiedenen GrtJBen des erfidungsgemfiBen Proteins kodiert 

20 

Im Stand der Technik sind mehrere Techniken zum Screening von Genprodukten kombinatorl- 
scher Banken, die durch Punktmutationen Oder VerkQrzung hergestellt worden sind, und zum 
Screening von cDNA-Banken auf Genprodukte mit einer ausgewShlten Eigenschaft bekannt 
Diese Techniken lassen sich an das schnelle Screening der Genbanken anpassen, die durch 

25 kombinatorische Mutagenese erfindungsgemaBer Homologer erzeugt worden sind. Die am hSu- 
figsten venwendeten Techniken zum Screening groBer Genbanken, die einer Analyse mit hohem 
Durchsatz unterliegen, umfassen das Klonieren der Genbank in replizierbare Expressionsvekto- 
ren, Transformieren dergeeigneten Zellen mit der resultierenden Vektorenbank und Exprimieren 
der kombinatorischen Gene unter Bedingungen, unter denen der Nachweis der gewOnschten 

30 Aktivitat die Isolation des Vektors, der das Gen codiert, dessen Produkt nachgewiesen wurde, 
erleichtert. Recursive-Ensemble-Mutagenese (REM), eine Technik, die die HSufigkeit funktionel- 
ler Mutanten in den Banken vergr6Bert, kann in Kombination mit den Screeningtests verwendet 
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werden, urn Homologe zu idenWizieren (Arkin und Yourvan (1992) PNAS 89:781 1-7815; Delgra- 
ve et al. (1 993) Protein Engineering 6(3):327-331 

c) Erfindungsqemafte Polvnukleotide 

Gegenstand der Erfindung sind ebehso NukleinsSuresequenzen (einzel- und doppelstrflngige 
DNA- und RNA-Sequenzen, wie z.B. cDNA und mRNA), kodierend fOr eines der obigen metH- 
Enzyme und deren funktionalen Aquivalenten, welche Z.B. auch unter Verwendung kOnstiicher 
Nukleotidanaloga zugSnglich sind. 

Die Erfindung betrifft sowohl isoiierte NukleinsauremolekOie, welche fQr erfindungsgemaUe Po- 
typeptide bzw. Proteine Oder biologisch aktive Abschnitte davon kodieren, sowie NukleinsSure- 
fragmente, die z.B. zur Verwendung als Hybridisierungssonden Oder Primer zur Identifizierung 
Oder Amplifizierung von erfindungsgemSIJerkodierenden NukleinsSuren verwendet werrfen k6n- 
nen. 

Die erfindungsgemSBen NukleinsduremolekOle kdnnen zudem untranslatierte Sequenzen vom 
3'- und/oder 5-Ende des kodierenden Genbereichs enthalten 

Ein "isoliertes" NukleinsauremolekQI wind von anderen NukleinsSuremolekQIen abgetrennt, die in 
der natQrlichen Quelle der Nukleinsflure zugegen sind und kann Dberdies im wesentlichen frei 
von anderem zellulSren Material oder Kulturmedium sein, wenn es durch rekombinante Technl- 
ken hergestellt wird, oder frei von chemischen Vorstufen oder anderen Chemikalien sein, wenn 
es chemisch synthetisiert wird. 

Die Erfindung umfasst weiterhin die zu den konkret beschriebenen Nukleotidsequenzen kom- 
plementaren NukleinsauremolekOie oder einen Abschnitt davon. 

Die erfindungsgemSB Nukleotidsequenzen ermfiglichen die Erzeugung von Sonden und Pri- 
mem, die zur Identifizierung und/oder Klonierung von homologer Sequenzen in anderen Zellty- 
pen und Organismen verwendbar sind. Solche Sonden bzw. Primer umfassen gew6hnlich einen 
Nukleotidsequenzbereich, der unter stringenten Bedingungen an mlndestens etwa 12 f vorzugs- 
weise mindestens etwa 25, wie z.B. etwa 40, 50 oder 75 aufeinanderfolgende Nukleotide eines 
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Sense-Stranges einer erfindungsgemaiien NukleinsSuresequenz oder eines entsprechenden 
Antisense-Stranges hybridisiert 

Weitere erfindungsgemSBe NukleinsSuresequenzen sind abgeleitet von SEQ ID NO:1, 3, 5, 7, 9, 
5 11,13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49 Oder 51 und unter- 
scheiden sich davon durch Addition, Substitution, Insertion oder Deletion einzelner oder mehre- 
rer Nukleotide, kodieren aber weiterhin fOr Polypeptide mit dem gewOnschten Eigenschaftsprofil. 
Dies kdnnen Polynukleotide sein, die zu obigen Sequenzen, beispielsweise Ober die gesamte 
Linge, in mindestens etwa 50%, 55%, 60%, 65%, 70%, 80% oder 90%, vorzugsweise in min- 
1 0 destens etwa 95%, 96%, 97%, 98% oder 99% der Sequenzpositionen identisch sind. 

Erfindungsgem§li umfasst sind auch solche NukleinsSuresequenzen, die sogenannte stumme 
Mutationen umfassen oder entsprechend der Codon-Nutzung eins speziellen Ursprungs- oder 
Wirtsorganismus, im Vergleich zu einer konkret genannten Sequenz verSndert sind, ebenso wie 
1 5 natOrlich vorkommende Varianten, wie z.B. Spleilivarianten Oder Allelvarianten, davon. Gegens- 
tand sind ebenso durch konservative Nukleotidsubstutionen (d.h. die betreffende AminosSure 
wird durch eine AminosSure gleicher Ladung, Gr&Be, Poiaritat und/oder LCslichkeit ersetzt) er- 
hSIUiche Sequenzen. 

20 Gegenstand der Erfindung sind auch die durch Sequenzpolymorphismen von den konkret offen- 
barten Nukleinsauren abgeleiteten Molekflle. Diese genetischen Polymorphismen kdnnen zwl- 
schen Individuen innertialb einer Population aufgrund der natQrlichen Variation existieren. Diese 
natOrlichen Variationen bewirken Oblicherweise eine Varianz von 1 bis 5 % in der Nukleotidse- 
quenz eines Gens. 

25 

Weiterhin umfasst die Erfindung auch NukleinsSuresequenzen, welchen mit oben genannten 
kodierenden Sequenzen hybridisieren oder dazu komplementSr sind. Diese Polynukteotide las- 
sen sich bei Durchmusterung von genomischen oder cDNA-Banken auffinden und gegebenen- 
falls daraus mit geeigneten Primem mittels PCR vermehren und anschlieliend beispielsweise 
30 mit geeigneten Sonden isolieren. Eine weitere Mdglichkeit bietet die Transformation geeigneter 
Mikroorganismen mit erfindungsgemaBen Polynukleotiden oderVektoren, die Vermehrung der 
Mikroorganismen und damit der Polynukleotide und deren anschlieBende Isolierung. Dariiber 
hinaus kdnnen erfindungsgem§Be Polynukleotide auch auf chemischem Wege synthetisiert 



WO 03/087386 PCT/EP03/04010 



15 

werden. 

Unter der Eigenschatt, an Polynukleotlde .hybridisieren" zu kannen, versteht man die Fahlgkeit 
eines Poly- oderOligonukleotids unter stringenten Bedingungen an eine nahezu komplementare 
Sequenz zu binden, wShrend unter diesen Bedingungen unspezifische Bindungen zwischen 
nicht-komplementaren Partnem unterbleiben. Dazu sollten die Sequenzen zu 70-100%, vor- 
zugsweise zu 90-100%, komplementflr sein. Die Eigenschatt komplementSrer Sequenzen, 
spezifisch aneinander binden zu kdnnen, macht man sich beispielsweise in der Northern- oder 
Southem-Blot-Technik oder bei der Primerbindung in PCR oder RT-PCR zunutze. Oblicherweise 
werden dazu Oligonukleotlde ab einer LSnge von 30 Basenpaaren eingesetzt Unter stringenten 
Bedingungen versteht man beispielsweise in der Northem-Blot-Technik die Verwendung einer 
50 - 70 °C, vorzugsweise 60 - 65 °C warmen WaschlSsung, beispielsweise 0, 1x SSC-Puffer mit 
0,1% SDS (20x SSC: 3M NaCI, 0,3M Na-Citrat, pH 7,0) zur Elution unspeziflsch hybridisierter 
cDNA-Sonden oder Oligonukleotide. Dabei bleiben, wie oben erwfihnt, nur in hohem Ma&e 
komplementare NuklelnsSuren aneinander gebunden. Die Einstellung stringenter Bedingungen 
ist dem Fachmann bekannt und ist z:B. in Ausubel et al., Current Protocols in Molecular Biology, 
John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. beschrieben. 

c) Isolierunq der kodierenden metH-Gene 

Die fOrdas Enzym Methionln-Synthase(EC 2.1.1 .13) kodierenden metH Gene aus den Organis- 
men obiger Liste I sind in an sich bekannter Weise isolierbar. 

Zur Isolierung der metH-Gene oder auch anderer Gene der Organismen aus obiger Liste I wird 
zunSchst eine Genbank dieses Organsimus in Escherichia coli (E. coli) angelegt Das Anlegen 
von Genbanken ist in allgemein bekannten LehrtOchem und HandbOchem ausfOhrlich beschrie- 
ben. Als Beispiel seien das Lehrbuch von Winnacker Gene und Klone, Eine EinfOhrung in die 
Gentechnologie (Verlag Chemie, Weinheim, Deutschland, 1990), oder das Handbuch von 
Sambrook et al.: Molecular Cloning, A Laboratory Manual (Cold Spring Harbor Laboratory Press, 
1 989) genannt Eine sehr bekannte Genbank ist die des E. coli K-1 2 Stammes W31 1 0, die von 
Kohara et al. (CellSO, 495-508 (198)) in X-Vektoren angelegt wurde. 
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Zur Herstellung einer Genbank von Organismen der Liste I in E. coll kannen Cosmide, wie der 
CosmidvektorSuperCos I (Wahletal., 1987, Proceedings of the National Academy of Sciences 
USA, 84: 2160-2164), aber auch Plasmide, wie pBR322 (BoliVal; Life Sciences, 25, 807-618 
(1 979)) oder pUC9 (Vieira et al., 1 982, Gene, 1 9: 259-268), verwendet werden. Als Wirte eignen 
5 sich besonders solche E. coli StSmme, die restriktions- und rekombinationsdefekt sind. Ein Bel- 
spiel hierfOr ist der Stamm DHSamcr, der von Grant et al. (Proceedings of the National Academy 
of Sciences USA, 87 (1 990) 4645-4649) beschrieben wurde. Die mit Hilfe von Cosmiden klonier- 
ten langen DNA-Fragmente kdnnen anschlieliend wiederum in gflngige, fOr die Sequenzierung 
geeignete Vektoren subkloniert und anschlieBend sequenziert werden, so wie es z. B. bei San- 
1 0 ger et al. (proceedings of the National Academy of Sciences of the United States of America, 74: 
5463-5467, 1 977) beschrieben ist 

Die erhaltenen DNA-Sequenzen kflnnen dann mit bekannten Algorithmen bzw. Sequenzanalyse- 
Programmen, wie z. B. dem von Staden (Nucleic Acids Research 14,217-232(1986)), dem von 
15 Marck (Nucleic Acids Research 16, 1829-1836 (1988)) oder dem GCG-Programm von Butler 
(Methods ofBiochemical Analysis 39, 74-97 (1998)), untersucht weiden. 

Die fur die metH-Gene kodierenden DNA-Sequenzen von Organismen gemaB obiger Liste I 
wurden gefunden. Insbesondere wurden DNA-Sequenzen gem§B gemSB SEQ ID NO:1, 3, 5, 7, 
20 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31. 33, 35, 37, 39, 41, 43, 45, 47, 49 und 51 gefunden. 
Weiterhin wurde aus diesen voriiegenden DNA-Sequenzen mit den oben beschriebenen Metho- 
den die AminosSuresequenzen der entsprechenden Proteine abgeleitet Durch SEQ ID NO:2, 4, 
6, 8, 10, 12, 14, 16. 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50 und 52 sind 
die sich ergebenden AminosSuresequenzen der metH Genprodukte dargestellt. 

25 

Kodierende DNA-Sequenzen, die sich aus den Sequenzen gemaa SEQ ID NO:1 , 3 f 5, 7, 9, 1 1 , 
13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49 und 51 durch die Dege- 
neration des genetischen Kodes ergeben, sind ebenfalls Gegenstand der Erfindung. In gleicher 
Weise sind DNA-Sequenzen, die mit diesen Sequenzen oderdavon abgeleiteten Sequenzteilen 
30 hybridisieren, Gegenstand der Erfindung. 

Anleitungen zur Identifizierung von DNA-Sequenzen mittels Hybridisierung findetder Fachmann 
unter anderem im Handbuch "The DIG System Users Guide fOr Filter Hybridization" der Firma 
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Boehringer Mannheim GmbH (Mannheim, Deutschland. 1993) und bei Liebl et al. (International 
Journal of Systematic Bacteriology (1991 ) 41 : 255-260). Anleitungen zur Amplifikation von DNA- 
Sequenzen mit Hilfe der Polymerase-Kettenreaktion (PCR)findet der Fachmann unter anderem 
im Handbuch von Gait: Oligonukleotide synthesis: A Practical Approach (IRL Press, Ox- ford, 
5 UK, 1984) und bei Newton und Graham: PCR (Spektrum Akademischer Vertag, Heidelberg, 
Deutschland, 1994). 

Weiterhin ist bekannt, dass Anderungen am N- und/oder C- Terminus eines Proteins dessen 
Funktion nicht wesentlich beeintrachtigen oder sogar stabilisieren kdnnen. Angaben hierzu findet 
10 der Fachmann unter anderem bei Ben-Bassat et al. (Journal of Bacteriology 169: 751-757 
(1987)), bei O'Regan et al. (Gene 77: 237-251 (1989), bei Sahin-Toth et al. (Protein Sciences 3: 
240-247 (1994)). bei Hochuli et al. (Biontechnology 6: 1321-1325 (1988)) und in bekannten 
Lehrbuchem der Genetik und Molekularbiologie. 

1 5 Aminosauresequenzen, die sich in entsprechender Weise aus den SEQ ID NO:2, 4, 6, 8, 10, 12, 
14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50 und 52 ergeben, sind 
ebenfalls Bestandteil der Erfindung. 

d) ErfindunqsgemaB verwendete Wirtszellen 

20 

Weitere Gegenstfinde der Erfindung betreffen als Wirtszelle dienende Mikroorgansismen, insbe- 
sondere coryneforme Bakterien, die einen Vektor, insbesondere Pendelvektoroder Plasmidvek- 
tor, der wenigstens ein metH Gen gerfindungsgemaiJer Definition tragt, enthalten Oder in denen 
ein erfindungsgemSBes metH Gen exprimiert bzw. verstarkt ist 

25 

Diese Mikroorganismen k6nnen schwefelhaltige Feinchemikalien, insbesondere L-Methionin t 
aus Glucose, Saccharose, Lactose, Fructose, Maltose, Melasse, Starke, Cellulose Oder aus Gty- 
cerin und Ethanol herstellen. Vorzugsweise sind dies coryneforme Bakterien, insbesondere der 
Gattung Corynebacterium. Aus der Gattung Corynebacterium ist insbesondere die Art Coryne- 
30 bacterium glutamicum zu nennen, die in der Fachwelt fOr ihre Fahigkeit bekannt ist, L- 
Aminosduren zu produzieren. 
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Als Beispiele fQr geeignete StSmme coryneformer Bakterien sind solche der Gattung Corynebac- 
terium, insbesondere der Art Corynebacterium glutamicum (C. glutamicum), wie 
Corynebacterium glutamicum ATCC 13032, 
Corynebacterium acetoglutamicum ATCC 1 5806, 
5 Corynebacterium acetoacidophilum ATCC 13870, 
Corynebacterium thermoaminogenes FERM BP-1539, 
Corynebacterium melassecola ATCC 17965 

Oder 

1 0 der Gattung Brevibacterium, wie 

Brevibacterium flavum ATCC 14067 

Brevibacterium lactofermentum ATCC 13869 und 

Brevibacterium divaricatum ATCC 14020 zu nennen; 

Oder davon abgeleitete Stdmme, wie 
1 5 Corynebacterium glutamicum KFCC1 0065 

Corynebacterium glutamicum ATCC21608 

welche ebenfalls die gewOnschte Feinchemikalie Oder deren Vorstufe(n) produzieren. 
Mitder AbkOrzung KFCC ist die Korean Federation of Culture Collection gemeint, mitder AbkOr- 
20 zung ATCC die American type strain culture collection und mit der AbkOrzung FERM die Samm- 
lung des National institute of Bioscience and Human-Technology, Agency of Industrial Science 
and Technology, Japan. 

e) DurchfOhruno der erfindunqsaemaften Fermentation 

25 

ErfindungsgemSB wurde festgestellt, dass coryneforme Bakterien nach Oberexpression eines 
metH Gens aus Organismen der Liste I in vorteilhafter Weise schwefelhaltige Feinchemikalien, 
insbesondere L-Methionin, produzieren. 

30 Zur Erzielung einer Oberexpression kann der Fachmann unterschiedliche MaBnahmen einzeln 
oder in Kombination ergreifen. So kann die Kopienzahl der entsprechenden Gene erhSht wer- 
den, oder es kann die Promotor- und Regulationsregion oder die Ribosomenbindungsstelle, die 
sich stromaufwarts des Strukturgens befindet, mutiert werden. In gleicher Weise wirken Expres- 
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sionskassetten, die stromaufwarts des Staikturgens eingebaut werden. Durch induzierbare Pro- 
motoren ist es zusatzlich mOglich, die Expression im Veriaufe der fermentativen L-Methionin- 
Produktionzu steigem. Durch MaBnahmen zurVeriangerung der Lebensda uer der mRNA wind 
ebenfalls die Expression verbessert Weiterhin wird durch Verhinderung des Abbaus des En- 
5 zymproteins ebenfalls die Enzymaktivitat verstarict. Die Gene oder Genkonstiukte k6nnen ent- 
weder in Plasmiden mit unterschiedlicher Kopienzahl vorliegen oder im Chromosom integriert 
und amplifiziert sein. Altemativ kann weiterhin eine Oberexpression der betreffenden Gene 
durch VerSnderung der Medienzusammensetzung und KulturfOhrung erreicht werden. 

1 0 Anleitungen hierzu findet der Fachmann unter anderem bei Martin et al. (Biontechnology 5, 1 37- 
146 (1987)), bei Guerrero et al. (Gene 138, 35-41 (1994)), Tsuchiya und Morinaga 
(Bio/Technology 6, 428^30 (1 988)), bei Eikmanns et al. (Gene 1 02, 93-98 (1 991 )), In der Euro- 
paischen Patentschrifl 0472869, im US Patent 4,601 ,893, bei Schwarzerund POhler (Biotechno- 
logy 9, 84-87 (1991), belRemscheid etal. (Applied and Environmental Microbiology 60,1 26-1 32 

1 5 (1994). bei LaBarre et al. (Journal of Bacteriology 175, 1001-1007 (1993)), in der Patentanmel- 
dung WO 96/1 5246, bei Malumbres et al. (Gene 134, 15-24 (1993)), in der japanlschen Offenle- 
gungsschrift J P-A-1 0-229891, bei Jensen und Hammer (Biotechnology and Bioengineering 
58,. 1 91 -1 95 (1 998)), bei Makrides (Microbiological Reviews 60:51 2-538 (1 996) und in bekann- 
ten Lehrbuchern der Genetik und Molekularbiologie. 

20 

Gegenstand der Erfindung sind deshalb auch Expressionskonstrukte, enthaltend unter der ge- 
netischen Kontrolle regulativer Nukleinsauresequenzen eine fOr ein erfindungsgema&es Poly- 
peptid kodierende Nukleinsauresequenz; sowie Vektoren, umfassend wenigstens eines dieser 
Expressionskonstrukte. Vorzugsweise umfassen solche erfindungsgemaBen Konstrukte 5- 

25 stromaufwarts von der jeweiligen kodierenden Sequenz einen Promoter und 3-stromabwarts 
eine Terminatorsequenz sowie gegebenenfalls weitere Obliche regulative Elemente, und zwar 
jeweils operaliv verknOpft mit der kodierenden Sequenz. Unter einer .operativen VerknOpfung* 
versteht man die sequentielle Anordnung von Promotor, kodierender Sequenz, Terminator und 
gegebenenfalls weiterer regulativer Elemente derart, dass jedes der regulativen Elemente seine 

30 Funktion bei der Expression der kodierenden Sequenz bestimmungsgemail erf Qllen kann. Bel- 
spiele fur operativ verknOpfbare Sequenzen sind Aktivrieungssequenzen sowie Enhancer und 
dergleichen. Weitere regulative Elemente umfassen selektierbare Marker, Amplifikationssignale, 
ReplikationsursprOnge und dergleichen. Geeignete regulatorische Sequenzen sind z.B. be- 
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schrieben in Goeddel, Gene Expression Technology: Methods in Enzymology 185, Academic 
Press, San Diego, CA (1 990). 

ZusStzlich zu den artifiziellen Regulationssequenzen kann die natOrliche Regulationssequenz 
5 vor dem eigentlichen Strukturgen noch vorhanden sein. Durch genetische VerSnderung kann 
diese natOrliche Regulation gegebenenfalls ausgeschaltet und die Expression der Gene ertidht 
oder emiedrigt wertJen. Das Genkonstrnkt kann aber auch einfacheraufgebautsein, das hei&t 
es werden keine zusatzlichen Regulationssignale vor das Strukturgen insertiertund der natOrli- 
che Promotor mit seiner Regulation wind nicht entfemt Statt dessen wird die natOrliche Regulati- 
1 0 onssequenz so mutiert, dass keine Regulation mehr erfolgt und die Genexpression gesteigert 
oder verringert wind. Die Nukleins3uresequenzen kdnnen in einer oder mehreren Kopien im 
Genkonstrukt enthalten sein. 

Beispiele fOr brauchbare Promotoren sind: die Promotoren, ddh, amy, lysC, dapA, lysA aus Co- 
1 5 rynebacterium glutamicum, aber auch gram-positiven Promotoren SP02 wie sie in Bacillus Sub- 
tilis and Its Closest Relatives, Sonenshein, Abraham L,Hoch, James A., Losick, Richard; ASM 
Press, District of Columbia, Washington und Patek M. Eikmanns BJ. Patek J. Sahm H. Microbio- 
logy. 142 1297-309, 1 996 beschrieben sind, oder aber auch cos-, tap-, trp-, tet-, trp-tet-, lpp- f lac- 
, Ipp-lac-, laclq-, T7-, T5-, T3-, gal-, trc-, ara- f SP6-, lambda-PR- oder lambda-PL-Promotor, die 
vorteilhafterweise in gram-negativen Bakterien Anwendung finden. Bevorzugt ist auch die Ver- 
wendung induzierbarer Promotoren, wie Z.B. licht- und insbesondere temperaturinduztierbarer 
Promotoren, wie der PrPpPromotor. Prinzipiell kdnnen alle natOriichen Promotoren mit Ihren Re- 
gulationssequenzen verwendet werden. DarOber hinaus kflnnen auch synthetische Promotoren 
vorteilhaft verwendet werden. 

Die genannten regulatorischen Sequenzen sollen die gezielte Expression der Nukleinsfiurese- 
quenzen ermfiglichen. Dies kann beispielsweise je nach Wirtsorganismus bedeuten, dass das 
Gen erst nach Induktion exprimiert oder Oberexprimiert wird, oder dass es sofort exprimiert 
und/oder Oberexprimiert wird. 

Die regulatorischen Sequenzen bzw. Faktoren kdnnen dabei vorzugsweise die Expression posh 
tiv beeinflussen und dadurch erhdhen oder emiedrigen. So kann eine VerstSrkung der regulato- 
rischen Elemente vorteilhafterweise auf der Transkriptionsebene erfolgen, indem starke 
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Transkriptionssignale wie Promotoren und/oder "Enhanced verwendet werden. Daneben ist aber , 
auch eine VerstSrkung der Translation m6glich, indem beispielsweise die Stability der mRNA 
verbessert wind. 

5 Die Herstellung einer Expressionskassette erfolgt durch Fusion eines geeigneten Promotors, 
einer geeigneten Shine-Dalgamo-Sequenz mit einer metH-Nukleotidsequenz sowie einem ge- 
eigneten Terminationssignal. Dazu verwendet man g§ngige Rekombinations- und Klonie- 
rungstechniken, wie sie beispielsweise in Current Protocols in Molecular Biology, 1993, John 
Wiley & Sons, Incorporated, New York New York, PCR Methods, Gelfand, David H., Innis, Mi- 

1 0 chael A., Sninsky, John J. 1 999, Academic Press, Incorporated, California, San Diego, PCR 
Cloning Protocols, Methods in Molecular Biology Ser, Vol. 1 92, 2nd ed., Humana Press, New 
Jersey, Totowa. T. Maniatis, E.F. Fritsch und J. Sambrook, Molecular Cloning: A Laboratory Ma- 
nual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1989) sowie in TJ. Silhavy, M.L 
Berman und L.W. Enquist, Experiments with Gene Fusions, Cold Spring Harbor Laboratory, 

1 5 Cold Spring Harbor, NY (1 984) und in Ausubel, F.M. et al., Current Protocols in Molecular Biolo- 
gy, Greene Publishing Assoc. and Wiley Interscience (1987) beschrieben sind. 

Das rekombinante Nukleinsaurekonstrukt bzw. Genkonstaikt wind zur Expression in einem ge- 
eigneten Wirtsorganismus vorteilhafterweise in einen wirtsspezifischen Vektorinsertiert, der eine 

20 optimale Expression der Gene im Wirt ermdglicht. Vektoren sind dem Fachmann wohl bekannt 
und kflnnen beispielsweise aus "Cloning Vectors" (Pouwels P. H. et al., Hrsg, Elsevier, Amster- 
dam-New York-Oxford, 1985) entnommen werden. Unter Vektoren sind auBer Plasmiden auch 
alle anderen dem Fachmann bekannten Vektoren, wie beispielsweise Phagen, Transposons, IS- 
Elemente, Phasmide, Cosmide, und lineare oder zirkulare DNA zu verstehen. Diese Vektoren 

25 k6nnen autonom im Wirtsorganismus repliziert Oder chromosomal repliziert werden. 

Zur VerstSrkung wurden erfindungsgem§lie metH Gene beispielhaft mit Hilfe von episomalen 
Plasmiden Qberexprimiert. Als Plasmide eignen sich solche, die in coryneformen Bakterien repli- 
ziert werden. Zahlreiche bekannte Plasmidvektoren, wie z. B. pZ1 (Menkel et al., Applied and 
30 Environmental Microbiology (1989) 64: 549-554), pEKExl (Eikmanns et al., Gene 102: 93-98 
(1991)) Oder pHS2-1 (Sonnen et al., Gene 107: 69-74 (1991)) beruhen auf den kryptischen 
Plasmiden pHM1519, pBL1 oder pGA1. Andere Plasmidvektoren, wie z. B. pCUK5MCS, Oder 
solche, die auf pCG4 (US-A 4,489,160) oder pNG2 (Serwold-Davis et al., FEMS Microbiology 
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Letters 66, 119-124 (1990)) Oder pAG1 (US-A 5,158,891) beruhen, k8nnen in gleicher Weise 
verwendet werden. 

Weiterhin eignen sich auch solche Plasmidvektoren mit Hilfe derer man das Verfahren der Ge- 
namplifikation durch Integration in das Chromosom anwenden kann, so wie es belspielsweise 
von Remscheid etal. (Applied and Environmental Microbiology 60,126-132 (1994))zurDuplika- 
tion bzw. Amplifikation des hom-thrB-Operons beschrieben wurde. Bei dieser Methode wird das 
vollstSndige Gen in einen Plasmidvektor kloniert, der in einem Wirt (typischerweise E. coli), nicht 
aber in C. glutamicum replizieren kann. Als Vektoren kommen beispielsweise pSUP301 (Simon 
et al., Bio/ Technology 1,784-791 (1983)), pK18mob oder pK19mob (Schafer et al., Gene 
145,69-73 (1994)), Bernard et al., Journal ofMolecular Biology, 234: 534-541 (1993)), pEM1 
(Schrumpf etal. 1991, Journal of Bacteriology 173:4510-4516)oderpBGS8 (Sprattetal.,1986, 
Gene 41: 337-342) in Frage. Der Plasmidvektor, der das zu amplifizierende Gen enthait, wird 
anschlie&end durch Transformation in den gewOnschten Stamm von C. glutamicum QberfOhrL 
Methoden zur Transformation sind beispielsweise bei Thterbach et al. (Applied Microbiology and 
Biotechnology 29, 356-362 (1988)), Dunican und Shivnan (Biotechnology 7, 1067-1070 (1989)) 
und Tauch et al. (FEMS Microbiological Letters 123,343-347 (1994)) beschrieben. 

Enzyme kdnnen durch Mutationen in den korrespondierenden Genen derart in ihrer Aktivitdt 
beeinfluflt werden, dass es zu einer teilweisen oder vollstSndigen Vemngerung der Reaktions- 
geschwindigkeit der enzymatischen Reaktion kommt Beispiele fOr solche Mutationen sind dem 
Fachmann bekannt (Motoyama H. Yano H. Terasaki Y. Anazawa H. Applied & Environmental 
Microbiology. 67:3064-70, 2001 , Eikmanns BJ. Eggeling L. Sahm H. Antonie van Leeuwenhoek. 
64:145-63, 1993-94.) 

Zusatzlich kann es fQr die Produktion von schwefelhaltige Feinchemikalien, insbesondere L- 
Methionin, vorteilhaft sein, neben einer Expression bzw. VerstSrkung eines erfindungsgemaflen 
metH-Gen eines oder mehrere Enzyme des Methionin-Biosyntheseweges oder eines damit as- 
soziierten (d.h. in einem funktionelle Zusammenhang stehenden) Biosynthese-oder sonstigen 
Stoffwechselweges, wie des Cystein-, Lysin- Oder Threonin-Stoffwechselwegs, wie insbesondere 
der Aspratatsemialdehyd-Synthese, der Glykolyse, der Anaplerotik, des Pentose-Phosphat- 
Stoffwechsels, des 2itronens§ure-Zyklus oder des AminosSure-Exports zu verstarken. 
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So kann fOr die Herstellung von schwefelhaltige Feinchemikalien, insbesondere L-Methionin, 
eines oder mehrere der folgenden Gene verstarkt sein, (d.h. 2.B. in einer hflheren Koplenzahl 
vorliegen oder ein En2ym mit haherer Aktivitat oder Spezifitat kodieren): 

- das fOr eine Aspartatkinase kodierende Gen lysC (EP 1 108 790 A2; DNA-SEQ NO. 281), 

5 -das fOr eine Aspartat-Semialdehyd Dehydrogenasekodierende Gen asd (EP 1 108 790 A2; 
DNA-SEQ NO. 282), 

- das fOr die Glycerinaldehyd-3-Phosphat Dehydrogenase kodierende Gen gap (Eikmanns 
(1992), Journal of Bacteriology 174: 6076-6086), 

- das fOr die 3-Phosphoglycerat Kinase kodierende Gen pgk (Eikmanns (1 992), Journal of Bade- 
10 riology 174: 6076-6086), 

- das fOr die Pyruvat Carboxylase kodierende Gen pyc (Eikmanns (1992), Journal of Bacteriology 
174:6076-6086), 

• das for die Triosephosphat Isomerase kodierende Gen tpi (Eikmanns (1 992), Journal of Bacte- 
riology 174: 6076-6086), 

15 - das fOr die Homoserin O-Acetyltransferase kodierende Gen metA (EP 1 108 790 A2; DNA-SEQ 
NO. 725), 

- das for die Cystahionin-gamma-Synthase kodierende Gen metB (EP 1 108 790 A2; DNA-SEQ 
NO. 3491), 

- das fOr die Cystahionin-gamma-Lyase kodierende Gen metC (EP 1 1 08 790 A2; DNA-SEQ NO. 
20 3061), 

- das fOr die Serin-Hydroxymethyltransferase kodierende Gen glyA (EP 1 108 790 A2; DNA-SEQ 
NO. 1110). 

- das fur die O-Acetylhomoserin-Sulfhydrylase kodierende Gen metY (EP 1 108 790 A2; DNA- 
SEQ NO. 726), 

25 - das fOr die Methylentetrahydrofolat-Reduktase kodierende Gen metF (EP 1 1 08 790 A2; DNA- 
SEQ NO. 2379), 

- das fOr die Phosphoserin-Aminotransferase kodierende Gen serC (EP 1 108 790 A2; DNA- 
SEQ NO. 928) 

- eines fOr die Phosphoserin-Phosphatase kodierende Gen serB (EP 1 1 08 790 A2; DNA-SEQ 
30 NO. 334, DNA-SEQ NO. 467, DNA-SEQ NO. 2767) 

- das fOr die Serine Acetyl-Transferase kodierende Gen cysE (EP 1 1 08 790 A2; DNA-SEQ NO. 
2818) 
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- das fOr eine Homoserin-Dehydrogenase kodierende Gen hom (EP 1 108 790 A2; DNA-SEQ 
NO. 1306) 

So kann fflrdie Herstellung von schwefelhaltige Feinchemikalien, insbesondere L-Methionin, in 
5 coryneformen Bakterien, vorteilhaft sein, gleichzeitig wenigstens elnes der nachfolgenden Gene 
zu mutieren, insbesondere so, dass die korrespondierenden Proteine, verglichen mit nlcht mu- 
tierten Proteinen, in geringerem Mafie Oder nicht durch einen Stoflwechselmetabol'rten in ihrer 
Aktivitat beeinflusst werden: 

10 - das fQr eine Aspartatkinase kodierende Gen lysC (EP 1 1 08 790 A2; DNA-SEQ NO. 281 ) f 

- das fOr die Pyruvat Carboxylase kodierende Gen pyc (Eikmanns (1992), Journal of Bacteriology 
174:6076-6086). 

- das fOr die Homoserin O-Acetyltransferase kodierende Gen metA (EP 1 1 08 790 A2; DNA-SEQ 
NO. 725), 

15 -das fOrdie Cystahionin-gamma-Synthase kodierende Gen metB (EP 1 108 790 A2; DNA-SEQ 
NO. 3491), 

- das fOrdie Cystahionin-gamma-Lyase kodierende Gen metC (EP 1.108 790 A2; DNA-SEQ NO. 
3061), 

- das fOr die Serin-Hydroxymethyitransferase kodierende Gen glyA (EP 1 1 08 790 A2; DNA-SEQ 
20 NO. 1110), 

- das fOr die O-Acetylhomoserin-Sulfhydrylase kodierende Gen metY (EP 1 108 790 A2; DNA- 
SEQ NO. 726), 

-das fOrdie Methylentetrahydrofolat-Reduktase kodierende Gen metF (EP 1 108 790 A2; DNA- 
SEQ NO. 2379), 

25 - das fOr die Phosphoserin-Aminotransferase kodierende Gen serC (EP 1 108 790 A2; DNA- 
SEQ NO. 928) 

- eines fOrdie Phosphoserin-Phosphatase kodierende Gen serB (EP 1 108 790 A2; DNA-SEQ 
NO. 334, DNA-SEQ NO. 467, DNA-SEQ NO. 2767) 

- das fOr die Serine Acetyl-Transferase kodierende Gen cysE (EP 1 108 790 A2; DNA-SEQ NO. 
30 2818) 

- das fOr eine Homoserin-Dehydrogenase kodierende Gen hom (EP 1 108 790 A2; DNA-SEQ 
NO. 1306) 
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Weiterhin kann es fOr die Produktion von schwefelhaltige Feinchemikalien, insbesondere L- 
Methionin, vorteilhaft sein, zusatzlich zur Expression bzw. VerstSrkung eines der erfindungsge- 
mailen metH-Gene eines oder mehrere der folgenden Gene abzuschwachen, insbesondere 
deren Expression zu verringem, oder auszuschalten: 

5 

- das fQr die Homoserine-Kinase kodierende Gen thrB (EP 1 108 790 A2; DNA-SEQ NO. 3453) 

- das fQr die Threonin Dehydratase kodierende Gen ilvA (EP 1 108 790 A2; DNA-SEQ NO. 
2328) 

- das fOr die Threonin Synthase kodierende Gen thrC (EP 1 108 790 A2; DNA-SEQ NO. 3486) 
10 - das fQr die Meso-Diaminopimelat D-Dehydrogenase kodierende Gen ddh (EP 1 1 08 790 A2; 

DNA-SEQ NO. 3494) 

• das fur die Phosphoenolpyruvat-Carboxykinase kodierende Gen pck (EP 1 108 790 A2; DNA- 
SEQ NO. 3157) 

- das fOr die Glucose-6-Phosphat-6-lsomerase kodierende Gen pgi (EP 1 108 790 A2; DNA- 
15 SEQNO.950) 

- das fQr die Pyruvat-Oxidase kodierende Gen poxB (EP 1 108 790 A2; DNA-SEQ NO. 2873) 

- das fQr die Dihydrodipicolinat Synthase kodiemde Gen dapA(EP 1 1 08 790 A2; DNA-SEQ NO. 
3476) 

- das fQr die Dihydrodipicolinat Reduktase kodiemde Gen dapB (EP 1 108 790 A2; DNA-SEQ 
20 NO. 3477) 

- das fQr die Diaminopicolinat Decarboxylase kodiemde Gen lysA (EP 1 1 08 790 A2; DNA-SEQ 
NO. 3451) 

Weiterhin kann es fQr die Produktion von schwefelhaltige Feinchemikalien, insbesondere L- 
25 Methionin, vorteilhaft sein, zusatzlich zur Expression bzw. VerstSrkung eines der erfindungsge- 
mSBen metH-Gene in Coryneformen Bakterien gleichzeitig wenigstens eines der folgenden Ge- 
ne so zu mutieren, dass die enzymatische Aktivitdt des korrespondierenden Proteins teilweise 
oder vollstSndig verringert wird: 

30 - das fOr die Homoserine-Kinase kodierende Gen thrB (EP 1 1 08 790 A2; DNA-SEQ NO. 3453) 

- das fQr die Threonin Dehydratase kodierende Gen ilvA (EP 1 108 790 A2; DNA-SEQ NO. 
2328) 

- das fQr die Threonin Synthase kodierende Gen thrC (EP 1 108 790 A2; DNA-SEQ NO. 3486) 
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- das fur die Meso-Diaminopimelat D-Dehydrogenase kodierende Gen ddh (EP 1 108 790 A2; 
DNA-SEQ NO. 3494) 

- das fOr die Phosphoenolpyruvat-Carboxykinase kodierende Gen pck (EP 1 108 790 A2; DNA- 
SEQ NO. 3157) 

5 - das fOr die Glucose-6-Phosphat-6-lsomerase kodierende Gen pgi (EP 1 108 790 A2; DNA- 
SEQ NO. 950) 

- das fOr die Pyruvat-Oxidase kodierende Gen poxB (EP 1 108 790 A2; DNA-SEQ NO. 2873) 

- das fOr die Dihydrodipicolinat Synthase kodiernde Gen dapA(EP 1 1 08 790 A2; DNA-SEQ NO. 
3476) 

10 - das fOr die Dihydrodipicolinat Reduktase kodiernde Gen dapB (EP 1 108 790 A2; DNA-SEQ 
NO. 3477) 

- das fOr die Diaminopicolinat Decarboxylase kodiernde Gen lysA (EP 1 108 790 A2; DNA-SEQ 
NO. 3451) 

15 Weiterhin kann es fOr die Produktion von schwefelhaltige Feinchemikalien, insbesondere L- 
Methionin, vorteilhaft sein, neben der Expression bzw. VerstSrkung eines erfindungsgemfiBen 
metH-Gens unerwQnschte Nebenreaktionen auszuschalten, welche beispielsweise die Ausbeute 
an der Feichchemikalie verringem (Nakayama: "Breeding of Amino Acid Producing Microorga- 
nisms", in: Overproduction of Microbial Products, Krumphanz), Sikyta, Vanek (eds.), Academic 

20 Press, London, UK, 1 982). 

Die erfindungsgemSB hergestellten Mikroorganismen kdnnen kontinuierlich oderdiskontinuier- 
lich im batch- Verfahren (Satzkultivierung) Oder im fed batch (Zulaufverfahren) oder repeated fed 
batch Verfahren (repetitives Zulaufverfahren) zur Produktion von schwefelhaltige Feinchemika- 
25 lien, insbesondere L-Methionin, kultiviert werden. Eine Zusammenfassung Qberbekannte Kulti- 
vierungsmethoden ist im Lehrbuch von Chmiel (Bioprozelltechnik 1. EinfOhrung in die Bioverfah- 
renstechnik (Gustav Fischer Veriag, Stuttgart, 1 991 )) oder im Lehrbuch von Stoihas (Bioreakto- 
ren und periphere Einrichtungen (Vieweg Veriag, Braunschweig/Wiesbaden, 1994)) zu finden. 

30 Das zu verwendende Kulturmedium hat in geeigneter Welse den AnsprOchen der jeweiligen 
StSmme zu genugen. Beschreibungen von Kulturmedien verschiedener Mikroorganismen sind 
im Handbuch "Manual of Methods fOr General Bacteriology" der American Society fOr Bacterio- 
logy (Washington D. C, USA, 1981) enthalten. 
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Diese erfindungsgemau einsetzbaren Medien umfassen gewdhnlich eine Oder mehrere Koh- 
lenstoffquellen, Stickstoffquellen, anorganische Salze, Vitamine und/oder Spurenelemente. 

5 Bevorzugte Kohlenstoffquellen sind Zucker, wis Mono-, Dh Oder Polysaccharide. Sehr gute Koh- 
lenstoffquellen sind beispielsweise Glucose, Fructose, Mannose, Galactose, Ribose, Sorbose, 
Ribulose, Lactose, Maltose, Saccharose, Raffinose, Starke Oder Cellulose. Man kann Zucker 
auch Ober komplexe Verblndungen, wie Melassen, oder andere Nebenprodukte der Zucker- 
Raffinierung zu den Medien geben. Es kann auch vorteilhatt sein, Gemische verschledener Koh- 
10 lenstoffquellen zuzugeben. Andere mogliche Kohlenstoffquellen sind Ole und Fette wie z. B. 
Sojadl. Sonnenblumendl. ErdnufJfil und Kokosfett, FettsSuren wie z. B. Palmitinsfiure, Stearin- 
saure oder Linolsaure, Alkohole wie z. B. Glycerin, Methanol oder Ethanol und organische SSu- 
ren wie z. B. EssigsSure oder Milchsdure. 

1 5 Stickstoffquellen sind gewdhnlich organische oder anorganische Stickstoffverbindungen oder 
Materialien, die diese Verbindungen enthalten. Beispielhafte Stickstoffquellen umfassen Ammo- 
nia k-Gas oder Ammoniumsalze, wie Ammoniumsulfat, Ammoniumchlorid, Ammoniumphosphat 
Ammoniumcarbonat oder Ammoniumnitrat, Nitrate, Hamstoff, AminosSuren oder komplexe 
Stickstoffquellen, wie Maisquellwasser, Sojamehl, Sojaprotein, Hefeextrakt, Fleischextrakt und 

20 andere. Die Stickstoffquellen kflnnen einzeln oder als Mischung verwendet werden. 

Anorganische Salzverbindungen, die in den Medien enthalten sein kflnnen, umfassen die Chlo- 
rid-, Phosphor- oder Sulfatsalze von Calcium, Magnesium, Natrium, Kobalt, Molybdan, Kalium, 
Mangan, Zink, Kupfer und Etsen 
25 . 

Als Schwefelquelle fOr die Herstellung von schwefelhaltigen Feinchemikalien, insbesondere von 
Methionin, kdnnen anorganische schwefelhaltige Verbindungen wie beispielsweise Sulfate, Sulfi- 
te, Dithionite, Tetrathionate, Thiosulfate, Sulfide aber auch organische Schwefelverbindungen, 
wie Mercaptane und Thiole, verwendet werden. 

30 

Als Phosphorquelle konnen Phosphorsaure, Kaliumdihydrogenphosphat oder Dikaliumhydro- 
genphosphat oder die entsprechenden Natrium haltigen Salze verwendet werden. 
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Chelatbildner kannen zum Medium gegeben werden, um die Metallionen in Ldsung zu halten. 
Besonders geeignete Chelatbildner umfassen Oihydroxyphenole, wie Catechol oder Protocate- 
chuat, oder organische SSuren, wie CitronensSure. 

5 Die erfindungsgemSB eingesetzten Fermentationsmedien enthalten Qblicherweise auch andenB 
Wachstumsfaktoren, wie Vitamine oder Wachstumsfflrderer, zu denen beispielsweise Biotin, 
Riboflavin, Thiamin, Folsflure, Nikotinsaure, Panthothenat und Pyridoxin gehOren. Wachstums- 
faktoren und Salze stammen hSufig von komplexen Medienkomponenten, wie Hefeextrakt, Me- 
lassen, Maisquellwasser und dergleichen. Dem Kulturmedium konnen Oberdies geeignete Vor- 

1 0 stufen zugesetzt werden. Die genaue Zusammensetzung der Medienverbindungen hflngt stark 
vom jeweiligen Experiment ab und wird for jeden spezifischen Fall individuell entschieden. In- 
formation Ober die Medienoptimierung ist erhdltlich aus dem Lehrbuch "Applied Microbiol. Physi- 
ology, A Practical Approach" (Hrsg. P.M. Rhodes, P.F. Stanbury, IRL Press (1997) S. 53-73, 
ISBN 0 19 963577 3). Wachstumsmedien lassen sich auch von kommeraellen Anbietem bezie- 

1 5 hen, wie Standard 1 (Merck) oder BHI (Brain heart infusion, DIFCO) und dergleichen. 

SSmtliche Medienkomponenten werden, entwederdurch Hitze (20 min bei 1,5 bar und 121°C) 
Oder durch Sterilfiltration, sterilisiert Die Komponenten kdnnen entweder zusammen odemOti- 
genfalls getrennt sterilisiert werden. Sdmtliche Medienkomponenten kdnnen zu Beginn der An- 
20 zucht zugegen sein oder wahtfrei kontinuierlich oder chargenweise hinzugegeben werden. 

Die Temperatur der Kultur liegt normalerweise zwischen 1 5°C und 45°C, vorzugsweise bei 25°C 
bis 40°C und kann wShrend des Experimentes konstant gehalten oder verSndert werden. Der 
pH-Wert des Mediums sollte im Bereich von 5 bis 8,5, vorzugsweise um 7,0 liegen. Der pH-Wert 

25 fOr die Anzucht IdBt sich wShrend der Anzucht durch Zugabe von basische Verbindungen wie 
Natriumhydroxid, Kaliumhydroxid, Ammoniak bzw. Ammoniakwasseroder saure Verbindungen 
wie Phosphorsaure oder Schwefelsflure kontrollieren. Zur Kontrolle der Schaumentwicklung 
kdnnen AntischaummitteJ wie z. B. FettsSurepolyglykolester, eingesetzt werden. Zur Aufrechter- 
haltung der Stabilitat von Plasmiden konnen dem Medium geeignete selektiv wirkende Stoffe, 

30 wie z. B. Antibiotika, hinzugefOgt werden. Um aerobe Bedingungen aufrechtzuerhalten, werden 
Sauerstoff oderSauerstoff haltige Gasmischungen, wiez. B. Umgebungsluft, indie Kultur elnge- 
tragen. Die Temperatur der Kultur liegt normalerweise bei 20 P C bis 45°C. Die Kultur wird solange 
fortgesetzt, bis sich ein Maximum des gewQnschten Produktes gebildet hat. Dieses Ziel wild 
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normalerweise innerhaib von 10 Stunden bis 160 Stunden erreicht 

Die so erhaltenen, insbesondere L-Methionin enthaltenden, FermentationsbrOhen haben Obll- 
cherweise eine Trockenmasse von 7,5 bis 25 Gew.-%. 

Vorteilhaft ist auBerdem auch, wenn die Fermentation zumindest am Ende, insbesondere jedoch 
Ober mindestens 30% der Fermentationsdauer zuckerlimitiert gefahren wird. Das heiBt, dass 
wShrend dieserZeitdie Konzentration an verwertbaremZuckerim Fermentationsmediumauf £0 
bis 3 g/l gehalten, beziehungsweise abgesenkt wird. 

Die Fermentationsbrtihe wird anschlieBend weiterverarbeitet. Je nach Anforderung kann die 
Biomasse ganz Oder teilweise durch Separationsmethoden, wie z. B. Zentrifugation, Filtration, 
Dekantieren oder einer Kombination dieser Methoden aus der FermentationsbrOhe entfemt oder 
vollstdndig in ihr beiassen werden. 

AnschlieBend kann die FermentationsbrOhe mit bekannten Methoden. wie z. B. mrt Hitfe eines 
Rotationsverdampfers, DOnnschichtverdampfers, Fallfilmverdampfers, durch Umkehrosmose, 
oder durch Nanofiltration, eingedickt beziehungsweise aufkonzentriert werden. Diese aufkon- 
zentrierte Fermentationsbruhe kann anschlieBend durch Gefriertrocknung, SprOhtrocknung, 
Spruhgranulation oder durch andenveitige Verfahren aufgearbeitet werden. 

Es ist aber auch mSglich die schwefelhaltigen Feinchemikalien, insbesonder L-Methionin, weiter 
aufzureinigen. Hierzu wird die produkthaltige Brtlhe nach dem Abtrennen der Biomasse einer 
Chromatographic mit einem geeigneten Harz unterworfen, wobei das gewOnschte Produkt oder 
die Verunreinigungen ganz oder teilweise auf dem Chromatographieharz zurQckgehalten wer- 
den. Diese Chromatographieschritte kfinnen nOtigenfalls wiedertiolt werden, wobei die gleichen 
oder andere Chromatographieharze venvendet werden. Der Fachmann ist in der Auswahl der 
geeigneten Chromatographieharze und ihrer wirksamsten Anwendung bewandert Das gereinig- 
te Produkt kann durch Filtration Oder Ultrafiltration konzentriert und bei einer Temperatur aufbe- 
wahrt werden, bei der die Stability des Produktes maximal ist 

Die Identitat und Reinheit der isolierten Verbindung(en) kann durch Techniken des Standes der 
Technik bestimmt werden. Diese umfassen Hochleistungs-FIOssigkeitschromatographie (HPLC), 
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spektroskopische Verfahren, FSrbeverfahren, DQnnschichtchromatographie, NIRS, Enzymtest 
Oder mikrobiologische Tests. Diese Analyseverfahren sind zusammengefattt in: Patek et al. 
(1994) Appl. Environ. Microbiol. 60:133-140; Malakhova etal. (1996)Biotekhnologiya 11 27-32; 
und Schmidt et al. (1 998) Bioprocess Engineer. 1 9:67-70. Ulmann's Encyclopedia of Industrial 
5 Chemistry (1996) Bd. A27, VCH: Weinheim, S. 89-90, S. 521-540, S. 540-547, S. 559-566, 575- 
581 und S. 581-587; Michal, G (1999) Biochemical Pathways: An Atlas of Biochemistry and Mo- 
lecular Biology, John Wiley and Sons; Fallon, A. et al. (1987) Applications of HPLC in Bioche- 
mistry in: Laboratory Techniques in Biochemistry and Molecular Biology, Bd. 17. 

1 0 Die Erfindung wird nun anhand der folgenden nicht-limitierenden Beispiele nfiher beschrieben: 
Belspiel 1: Konstruktion von pCUK5MCS 

ZunSchst wurden Ampicillinresistenz und Replikationsursprung des Vektors pBR322 mit den 
15 Oligonukleotiden p1.3 (SEQ ID NO:53) und p2.3 (SEQ ID NO:54) mit Hilfe der Polymerase- 
Kettenreaktion (PCR) amplifiziert 

p1.3(SEQIDNO:53) 

5 , -CCCGGGATCCGCTAGCGGCGCGCCGGCCGGCCCGGTGTGAAATACCGCACAG-3 i 

20 

p2.3 (SEQ ID NO:54) 

ff-TCTAGACTCGAGCGGCCGCGGCCGGCCTTTAAATTGAAGACGAAAGGGCCTCG-S' 

Neben den zu pBR322 komplementSren Sequenzen, enthait das Oligonukleotid p1.3 (SEQ ID 
25 NO:53) in 5-3' Richtung die Schnittstellen fur die Restriktionsendonukleasen Smal, BamHI, Nhel 
und AscI und das Oligonukleotid p2.3 (SEQ ID NO:54) in 5'-3' Richtung die Schnittstellen fOr 
die Restriktionsendonukleasen Xbal, Xhol, Notl und Dral. Die PCR Reaktion wurde nach Stan- 
dardmethode wie Innis et al. (PCR Protocols. A Guide to Methods and Applications, Academic 
Press (1 990)) mit PfuTurbo Polymerase (Stratagene, La Jolla, USA) durchgefuhrt Das erbalte- 
30 ne DNA Fragment mit einer Grille von ungefahr 2,1 kb wurde mit dem GFX™PCR, DNA and 
Gel Band Purification Kit (Amersham Pharmacia, Freiburg) nach Angaben des Herstellers gerei- 
nigt Die stumpfen Enden des DNA-Fragmentes wurden mit dem Rapid DNA Ligation Kit (Roche 
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Diagnostics, Mannheim) nach Angaben des Herstellers miteinander ligiert und der Ligationsan- , 
satz nach Standardmethoden wie in Sambrook et al. (Molecular Cloning. A Laboratory Manual, 
Cold Spring Harbor, beschrieben(1989)), in kompetente E.coli XL-IBlue (Stratagene, La Jolla, 
USA) transformiert Eine Selektion auf Plasmid tragende Zellen wurde durch das Ausplattieren 
5 auf Ampicillin (SOpg/ml) haltigen LB Agar (Lennox, 1955, Virology, 1:190) enreichL 

Die Plasmid-DNA eines individuellen Klons wurde mit dem Qiaprep Spin Miniprep Kit (Qiagen, 
Hilden) nach Angaben des Herstellers isoliert und Qber Restriktionsverdaus Oberprtft Das so 
ertialtene Plasmid erhait den Namen pCUKL 

10 

Ausgehend vom Plasmid pWLT1 (Liebl et al., 1 992) als Template fOr eine PCR Reaktion wurde 
mit den Oligonukleotiden neol (SEQ ID NO:55) und neo2 (SEQ ID NO:56) eine Kanamycin- 
Resistenzcassette amplifiziert 

15 neol (SEQ ID NO:55): 

S'-GAGATCTAGACCCGGGGATCCGCTAGCGGGCTGCTAAAGGAAGCGGA^ 

neo2 (SEQ ID NO:56): 

5-GAGAGGCGCGCCGCTAGCGTGGGCGAAGAACTCCAGCA-3' 

20 

Neben den zu pWLT1 komplementSren Sequenzen, enthait das Oligonukleotid neol in 5-3' 
Richtung die Schnittstellen fDr die Restriktionsendonukleasen Xbal, Smal, BamHI, Nhel und das 
Oligonukleotid neo2 (SEQ ID NO:56) in 5'-3' Richtung die Schnittstellen fOr die Restriktionsen- 
donukleasen AscI und Nhel. Die PCR Reaktion wurde nach Standardmethode wie Innis et al. 

25 (PCR Protocols. A Guide to Methods and Applications, Academic Press (1 990)) mit PfuTurbo 
Polymerase (Stratagene, La Joila, USA) durchgefOhrt. Das ertialtene DNA Fragment mit einer 
GrolJe von ungefShr 1 ,3 kb wurde mit dem GFX™PCR, DNA and Gel Band Purification Kit (A- 
mersham Pharmacia, Freiburg) nach Angaben des Herstellers gereinigt Das DNA-Fragment 
wurde mit den Restriktionsendonukleasen Xbal und AscI (New England Biolabs, Beverly, USA) 

30 geschnitten und im Anschluft daran emeut mit dem GFX™PCR, DNA and Gel Band Purification 
Kit (Amersham Pharmacia, Freiburg) nach Angaben des Herstellers gereinigt Der Vektor 
pCLiKI wurde ebenfalls mit den Restriktionsendonukleasen Xbal und AscI geschnitten und mit 
alkalischer Phosphatase (Roche Diagnostics, Mannheim) nach Angaben des Herstellers 
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dephosphoryliert Nach Elektrophorese in einem 0,8%igen Agarosegel wurde der linearisierte 
Vektor (ca. 2,1 kb) mit dem GFX™PCR, DNA and Gel Band Purification Kit (Amersham Pharma- 
cia, Freiburg) nach Angaben des Herstellers isolierl Dieses Vektor-Fragment wurde mit Hilfe 
des Rapid DNA Ligation Kit (Roche Diagnostics, Mannheim) nach Angaben des Herstellers mit 
5 dem geschnittenen PCR Fragment ligiert und der Ligationsansatz nach Standardmethoden wie 
in Sambrook et al. (Molecular Cloning. A Laboratory Manual, Cold Spring Harbor, beschrie- 
ben(1 989)), in kompetente E.coli XL-1 Blue (Stratagene, La Jolla, USA) transformiert Eine Selek- 
tion auf Plasmid tragende Zellen wurde durch das Ausplattieren auf Ampicillin (50pg/ml) und 
Kanamycin (20pg/ml) haltigen LB Agar (Lennox, 1955, Virology, 1:190) erreicht 

10 

Die P!asmid-DNA eines individuellen Klons wurde mit dem Qiaprep Spin Miniprep Kit (Qiagen, 
Hilden) nach Angaben des Herstellers isoliert und Ober Restriktionsverdaus QberprOft Das so 
erhaltene Plasmid erhatt den Namen pCUK2. 

1 5 Der Vektor pCLiK2 wurde mit der Restriktionsendonuklease Dral (New England Biolabs, Beverly, 
USA) geschnitten. Nach Elektrophorese in einem 0,8%igen Agarosegel wurde ein ca. 2,3 kb 
groBes Vektorfragment mit dem GFX™PCR, DNA and Gel Band Purification Kit (Amersham 
Pharmacia, Freiburg) nach Angaben des Herstellers isoliert Dieses Vektor-Fragment wurde mit 
Hilfe des Rapid DNA Ligation Kit (Roche Diagnostics, Mannheim) nach Angaben des Herstellers 

20 religiert und der Ligationsansatz nach Standardmethoden wie in Sambrook et al. (Molecular Clo- 
ning. A Laboratory Manual, Cold Spring Harbor, beschrieben (1989)), in kompetente E.coli XL- 
1Blue (Stratagene, La Jolla, USA) transformiert Eine Selektion auf Plasmid tragende Zellen 
wurde durch das Ausplattieren auf Kanamycin (20pg/ml) haltigen LB Agar (Lennox, 1 955, Viro- 
logy, 1:190) erreicht 

25 

Die Plasmid-DNA eines individuellen Klons wurde mit dem Qiaprep Spin Miniprep Kit (Qiagen, 
Hilden) nach Angaben des Herstellers isoliert und Qber Restriktionsverdaus OberprOfl Das so 
erhaltene Plasmid erhSIt den Namen pCLiK3. 

30 Ausgehend vom Plasmid pWLQ2 (Liebl etal., 1992) als Template fOr eine PCR Reaktion wurde 
mit den Oligonukleotiden cg1 ((SEQ ID NO:57) und cg2 (SEQ ID NO:58) der Replikation- 
sursprung pHM1519 amplifiziert 
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cg1 (SEQ ID NO:57): 

5X3AGAGGGCGGCCGCGCAMGTCCCGCTTCGTGAA-3' 

cg2 (SEQ ID NO:58): 
5 5-GAGAGGGCGGCCGCTCAAGTCGGTCAAGCCACGC-3' 

Neben den zu pWLQ2 komplementaren Sequenzen, enthalten die Oligonukleofide cg1 (SEQ ID 
NO:57) und cg2 (SEQ ID NO:58) Schnittstellen fOr die Restriktionsendonuklease Notl. Die PCR 
Reaktion wurde nach Standardmethode wie Innis et al. (PCR Protocols. A Guide to Methods and 
Applications, Academic Press (1990)) mit PfuTurbo Polymerase (Stratagene, La Jolla, USA) 
durchgefOhrt Das erhaltene DNA Fragment mit einer GrOfJe von ungefahr2,7kb wurde mit dem 
GFX™PCR, DNA and Gel Band Purification Kit (Amersham Pharmacia, Freiburg) nach Angaben 
des Herstellers gereinigt Das DNA-Fragment wurde mit der Restriktionsendonuklease Notl 
(New England Biolabs, Beverly, USA) geschnitten und im AnschluB daran emeut mit dem 
GFX™PCR, DNA and Gel Band Purification Kit (Amersham Pharmacia, Freiburg) nach Angaben 
des Herstellers gereinigt Der Vektor pCLiK3 wurde ebenfalls mit der Restriktionsendonuklease 
Notl geschnitten und mit alkalischer Phosphatase (Roche Diagnostics, Mannheim)) nach Anga- 
ben des Herstellers dephosphoryliert Nach Elektrophorese in einem 0,8%igen Agarosegel wur- 
de der linearisierte Vektor (ca. 2,3kb) mit dem GFX™PCR, DNA and Gel Band Purification Kit 
(Amersham Pharmacia, Freiburg) nach Angaben des Herstellers isoliert Dieses Vektor- 
Fragment wurde mit Hilfe des Rapid DNA Ligation Kit (Roche Diagnostics, Mannheim) nach An- 
gaben des Herstellers mit dem geschnittenen PCR Fragment liglert und der Ligationsansatz 
nach Standardmethoden wie in Sambrook et al. (Molecular Cloning. A Laboratory Manual, Cold 
Spring Harbor, beschrieben(1 989)), in kompetente E.coli XL-1 Blue (Stratagene, La Jolla, USA) 
transformiert. Eine Selektion auf Plasmid tragende Zellen wurde durch das Ausplattleren auf 
Kanamycin (20pg/ml) haltigen LB Agar (Lennox, 1955, Virology, 1:190) erreichL 

Die Plasmid-DNA eines individuellen Klons wurde mit dem Qiaprep Spin Miniprep Kit (Qiagen, 
Hilden) nach Angaben des Herstellers isoliert und Ober Restriktionsverdaus OberprOfl Das so 
erhaltene Plasmid erhait den Namen pCLiK5. 

FOr die Erweiterung von pCLikS urn eine .multiple cloning site* (MCS) wurden die beide syntheB- 
schen, weitestgehend komplementaren Oligonukleotide HS445 ((SEQ ID NO:59) und HS446 
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(SEQ ID NO:60), die Schnittstellen fOr die Restriktionsendonukleasen Swal, Xhol, Aatl, Apal, 
Asp718, Mlul, Ndel, Spel, EcoRV, Sail, Clal, BamHI, Xbal und Smal enthalten, durch gemeirv 
sames erhitzen auf 95°C und langsames abkuhlen zu einem doppelsWngigen DNA-Fragment 
vereinigt 

5 

HS445(SEQIDNO:59): 

S'-TCGAATTTAAATCTCGAGAGGCCTGACGTCGGGCCCGGTACCACGCGTCATATGACTAG 

TTCGGACCTAGGGATATCGTCGACATCGATGCTCTTCTGCGTTAATTAACAATTGGGATCC 
TCTAGACCCGGGATTTAAAT-3' 

10 

HS446(SEQIDNO:60): 

S'-GATCATTTAAATCCCGGGTCTAGAGGATCCCAATTGTTAATTAACGCAGAAGAGCATCGA 

TGTCGACGATATCCCTAGGTCCGAACTAGTCATATGACGCGTGGTACCGGGCCCGACGTC 
AGGCCTCTCGAGATTTAAAT-3 1 

Der Vektor pCUKS wurde mit den Restriktionsendonuklease Xhol und BamHI (New England 
Biolabs, Beverly, USA) geschnitten und mit alkalischer Phosphatase (I (Roche Diagnostics, 
Mannheim)) nach Angaben des Herstellers dephosphoryliert Nach Elektrophorese in einem 
0,8%lgen Agarosegel wurde der linearisierte Vektor (ca. 5,0 kb) mit dem GFX™PCR, DNA and 

-20 Gel Band Purification Kit (Amersham Pharmacia, Freiburg) nach Angaben des Herstellers iso- 
liert Dieses Vektor-Fragment wurde mit Hilfe des Rapid DNA Ligation Kit (Roche Diagnostics, 
Mannheim) nach Angaben des Herstellers mit dem synthetischen Doppelstrflngigen DNA- 
Fragment ligiert und der Ligationsansatz nach Standardmethoden wie in Sambrook et al. (Mole- 
cular Cloning. A Laboratory Manual, Cold Spring Harbor, beschrieben(1989)), in kompetente 

25 E.coli XL-1 Blue (Stratagene, La Jolla, USA) transformiert Eine Selektion auf Plasmid tragende 
Zellen wurde durch das Ausplattieren auf Kanamycin (20pg/ml) haWgen LB Agar (Lennox, 1 955, 
Virology, 1:190) erreicht 

Die Plasmid-DNA eines individuellen Klons wurde mit dem Qiaprep Spin Miniprep Kit (Qiagen, 
30 Hilden) nach Angaben des Herstellers isoliert und Ober Restriktionsverdaus OberpriifL Das so 
erhaltene Plasmid erhait den Namen pCLiK5MCS. 

Sequenzierungsreaktionen wurden nach Sanger et al. (1 977) Proceedings of the National Aca- 
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demy of Sciences USA 74:5463-5467 durchgefOhrt Die Sequenziereaktionen wurden mittels 
ABI Prism 377 (PE Applied Biosystems, Weiterstadt) aufgetrennt und ausgewertet 

Das entstandene Plasmid pCLiK5MCS ist als SEQ ID NO: 63 aufgefOhrt 

5 

Beisplel 2: Konstruktlon von pCLIKSMCS Integrativ sacB 

Ausgehend vom Plasmid pK1 9mob (SchSfer et al„ Gene 1 45,69-73(1 994)) als Template fOr eine 
PCR Reaktion wurde mit den Oligonukleotiden BK1732 und BK1733 das Bacillus subtilis sacB 
1 0 Gen (kodierend fOr Levan Sucrase) amplifiziert 

BK1732(SEQIDNO:61): 

5•-GAGAGCGGCCGCCGATCCTTTTTAACCCATCAC-3 , 

15 BK1733(SEQIDNO:62): 

S-AGGAGCGGCCGCCATCGGCATTTTCTTTTGCG-S' 

Neben den zu pEK19mobsac komplementSren Sequenzen, enthalten die pligonukleotlde 
BK1732 und BK1733 Schnittstellen fOr die Restriktionsendonuklease Notl. Die PCR Reaktion 

20 wurde nach Standardmethode wie Innis et al. (PCR Protocols. A Guide to Methods and Applica- 
tions, Academic Press (1990)) mit PfuTurbo Polymerase (Stratagene, La Jolla, USA) durchge- 
fOhrt. Das erhaltene DNA Fragment mit einer GrflBe von ungefahr 1,9 kb wurde mit dem 
GFX™PCR, DNA and Gel Band Purification Kit (Amersham Pharmacia, Freiburg) nach Angaben 
des Herstellers gereinigt Das DNA-Fragment wurde mit der Restriktionsendonuklease Notl 

25 (New England Biolabs, Beverly, USA) geschnitten und im AnschluB daran emeut mit dem 
GFX™PCR, DNA and Gel Band Purification Kit (Amersham Pharmacia, Freiburg) nach Angaben 
des Herstellers gereinigt. 

Der Vektor pCUK5MCS (hergestellt gemSB Beispiel 1) wurde ebenfalls mit der Restriktionsen- 
30 donuklease Notl geschnitten und mit alkalischer Phosphatase (I (Roche Diagnostics, Mann- 
heim)) nach Angaben des Herstellers dephosphoryliert Nach Elektrophorese in einem 0,8%igen 
Agarosegel wurde ein ungefahr 2,4 kb groBes Vektorfragment mit dem GFX^PCR, DNA and 
Gel Band Purification Kit (Amersham Pharmacia, Freiburg) nach Angaben des Herstellers iso- 



WO 03/087386 



PCT/EP03/04010 



36 

liert Dieses Vektor-Fragment wurde mit Hilfe des Rapid DNA Ligation Kit (Roche Diagnostics, 
Mannheim) nach Angaben des Herstellers mit dem geschnittenen PCR Fragment ligiert und der 
Ligationsansatz nach Standardmethoden wie in Sambrook et al. (Molecular Cloning. A Laborato- 
ry Manual, Cold Spring Harbor, beschrieben(1989)) f in kompetente E.coliXL-1Blue (Stratagene, 
5 La Jolla, USA) transformlert Eine Selektion auf Plasmid tragende Zellen wurde durch das Aus- 
plattieren auf Kanamycin (20pg/ml) haltigen LB Agar (Lennox, 1955. Virology, 1:190) erreicht 

Die Plasmid-DNA eines individuellen Klons wurde mit dem Qiaprep Spin Miniprep Kit (Qiagen, 
Hilden) nach Angaben des Herstellers isoliert und Qber Restriktionsverdaus OberprOft Das so 
1 0 erhaltene Plasmid erhait den Namen pCLiKSMCS integrativ sacB. 

Sequenzierungsreaktionen wurden nach Sanger et al. (1977) Proceedings of the National Aca- 
demy of Sciences USA 74:5463-5467 durchgefOhrt Die Sequenzierreaktionen wurden mittels 
ABI Prism 377 (PE Applied Biosystems, Weiterstadt) aufgetrennt und ausgewertet 

15 

Das entstandene Plasmid pCLiK5MCS integrativ sacB 1st als SEQ ID NO: 64 aufgefOhrt 

Weitere Vektoren die zur erfindungsgemSften Expression oderOberprodukfion von metH-Genen 
geeignet sind, kflnnen in analoger Weise herstellt werden. 

20 

In den folgenden Beispielen 3 bis 8 wird die schrittweise Konstruktion eines verbesserten Me- 
thionin-produzierenden Stammes mit der Bezeichnung LU1479 lysC 311ile ET-16 pC Phsdh 
metH_Sc beschrieben. 

25 Belsplel 3: Isolierung des lysC gens aus dem C. glutamicum Stamm LU1479 

Im ersten Schritt der Stammkonstruktion soil ein allelischer Austausch des lysC Wildtypgens, 
kodierend fur das Enzym Aspartatkinase, in C. glutamicum ATCC13032, im folgenden LU1479 
genannt, durchgefOhrt werden. Dabei soli im LysC Gen ein Nukleotidaustausch durchgefOhrt 
30 werden, so dass im resultierenden Protein die Aminosfiure Thr an der Position 31 1 durch die 
Aminosdure He ausgetauscht isL 

Ausgehend von der chromosomalen DNA aus LU1479 als Template fOr eine PCR Reaktion 
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wurde mit den Oligonukleotidprimem SEQ ID NO:65 und SEQ ID NO:66 lysC mit Hilfe des Pfu- , 
Turbo PCR Systems (Stratagene USA) nach Angaben des Herstellers amplifiziert Chromo- 
somale DNA aus C. glutamicum ATCC 13032 wurde nach Tauch et al. (1995) Plasmid 33:168- 
1 79 Oder Eikmanns et al. (1 994) Microbiology 140:1 81 7-1 828 prapariert Das ampfifizierte Frag- 
5 ment wird an seinem 5'-Ende von einem Sail Restriktionsschnitt und an seinem 3 -Ende von 
einem Mlul Restriktionsschnitt flankiert Vor der Klonierung wurde das amplifizierte Fragment 
durch diese beiden Restriktionsenzyme verdaut und mit GFX™PCR, DNA and Gel Band Purifi- 
cation Kit (Amersham Pharmacia, Freiburg) aufgereinigt. 

10 SEQIDN0:65 

S'-GAGAGAGAGACGCGTCCCAGTGGCTGAGACGCATC -3' 

SEQ ID NO:66 

5 , -CTCTCTCTGTCGACGAATTCAATCTTACGGCCTG-3• 

15 

Das erhaltenen Polynukleotid wurde Ober die Sail und Mlul Restriktionsschnitte in pCLIKS MCS 
integrativ SacB (im folgenden pCIS genannt; SEQ ID NO: 64 aus Beispiel 2) kloniert und In 
E.coli XL-1 blue transformiert Eine Selektion auf Plasmid-tragende Zellen wurde durch das 
Ausplattieren auf Kanamycin (20pg/ml)-haltigen LB Agar (Lennox, 1955, Virology, 1:190) er- 

20 reicht Das Plasmid wurden isoliert und durch Sequenzierung die erwartete Nukleotidsequenz 
bestatigt Die Preparation der Plasmid-DNA wurde nach Methoden und mit Materialien der Firma 
Quiagen durchgefOhrt Sequenzierungsreaktionen wurden nach Sanger et al. (1977) Procee- 
dings of the National Academy of Sciences USA 74:5463-5467 durchgefOhrt. Die Sequenzier- 
reaktionen wurden mittels ABI Prism 377 (PE Applied Biosystems, Weiterstadt) aufgetrennt und 

25 ausgewertet Das erhaltene Plasmid pCIS lysC ist als SEQ ID NO:77 aufgefOhrt 

Die Sequenz SEQ ID NO:77 umfasst die folgenden wesentlichen Teilbereiche: 



Position 


Art der Se- 
quenz 


Beschreibung 


155-1420 


CDS 1 ' 


lysC 


1974 - 2765 


CDS 


Kanamycin-Resistenz 
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3032-3892 


CDS 


Replikationsursprung/ 


(complement) 2 * 




E.coli/Plasmid pMB 



Kodierende Sequenz 
auf KomplementSrstrang 



Belsplel 4: Mutagenese des lysC Gens aus C. glutamicum 

5 Die gerichtete Mutagenese des lysC Gens aus C. glutamicum (Beispiel 3) wurde mit dem 
QuickChange Kit (Fa. Stratagene/USA) nach Angaben des Herstellers durchgefQhrt Die Muta- 
genese wurde im Plasmid pCIS lysC, SEQ ID NO:77 durchgefQhrt. FOr den Austausch von 
thr311 nach 311ile mit Hilfe der Quickchange Methode (Stratagene) wurxJen folgende Oligo- 
nukleotidprimer synthetisiert 

10 

SEQ ID NO:67 

S'-CGGCACCACCGACATCATCTTCACCTGCCCTCGTTCCG -3* 
SEQ ID NO:68 

1 5 S'-CGGAACGAGGGCAGGTGAAGATGATGTCGGTGGTGCCG -3' 

Der Einsatz dieser Oligonukleotidprimer in der Quickchange Reaktion fOhrt in dem lysC Gen zu 
einem Austausch des Nukleotids in Position 932 (von C nach T) (vgl. SEQ ID NO:75) und im 
konrespondierenden Enzym zu einem AminosSuresubstitution in Position 311 (Thr>lle) (vgl. 
20 SEQ ID NO:76). Der resultierende AmlnosSureaustausch Thr311lle im lysC Gen wurde nach 
Transformation in E.coli XL1-bIue und PlasmidprSparation durch Sequenzierung bestatigt Das 
Plasmid erhielt die Bezeichnung pCIS lysC thr31 1ile und ist als SEQ ID NO:78 aufgefflhrt 

Die Sequenz SEQ ID NO:78 umfasst die folgenden wesentlichen Teilbereiche: 

25 



Position 


Art der Se- 
quenz 


Beschreibung 


155-1420 


CDS 1 ' 


lysC mutiert 


1974-2765 


CDS 


Kanamycin-Resistenz 



I 
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3032-3892 


CDS 


Replikationsursprung/ 


(complement) 2 * 




E.coli/Plasmid pMB 



Kodierende Sequenz 
auf KomplementSrstrang 



Das Plasmid pCIS lysC thr31 1 i!e wurde in C. glutamicum LU1479 mittels Elektroporation wie bei 
5 Liebl, et al. (1 989) FEMS Microbiology Letters 53:299-303 beschrieben, transformiert Modifikati- 
onen des Protokolls sind in DE-A-1 0046870 beschrieben. Die chromosomale Anordnung des 
lysC-Lokus einzelner Transformanten wurde mit Standardmethoden durch Southemblot und 
Hybridisierung, wie in Sambrook et al. (1989), Molecular Cloning. A Laboratory Manual, Cold 
Spring Harbor, beschrieben, OberprOfL Dadurch wurde sichergestellt, dass es sich bei den 
1 0 Transformanten urn solche handelt, die das transformierte Plasmid durch homologe Rekombina- 
tion am lysC-Lokus integriert haben. Nach Wachstum solcher Kolonien Ober Nacht in Medien, 
die kein Antibiotikum enthielten, wurden die Zellen auf ein Saccharose-CM-Agarmedium (10% 
Saccharose) ausplattiert und bei 30°C fQr 24 Stunden inkubiert 

Da das im Vektor pCIS lysC thr31 1ile enthaltende sacB Gen Saccharose in ein toxisches Pro- 
1 5 dukt umwandelt, kdnnen nur solche Kolonien anwachsen, die das sacB Gen durch einen zwel- 
ten homologen Rekombinationsschrittzwischen dem Wildtyp lysC Gen und dem mutierten Gen 
lysC thr31 1 ile delefiert haben. Wahrend der homologen Rekombination kann entweder das Wild- 
typ Gen oder das mutierte Gen zusammen mit dem sacB Gen deletiert werden. Wenn das sacB 
Gen zusammen mit dem Wildtyp Gen entfemt wird, resultiert eine mutierte Transformante. 

20 

Anwachsende Kolonien wurden gepickt, und auf eine Kanamycin-sensitiven PhSnotyp hin unter- 
sucht. Klone mit deletiertem SacB Gen mOssen glelchzeitg Kanamycin-sensitives Wachstums- 
verhalten zeigen. Solche Kan-sensitiven Klone wurde im einem Schuttelkolben auf ihre Lysin- 
Produktivitat hin untersucht (siehe Beispiel 6). Zum Vergleich wurde der nichtbehandelte Stamm 
25 LU 1 479 angezogen. Klone mit einer gegenOber der Kontrolle erhOhten Lysin-Produ kfion wurden 
selektiert, chromosomale DNA wurde gewonnen und der entsprechende Bereich des lysC Gens 
wurde durch eine PCR-Reaktion amplifiziert und sequenziert Ein solcher Klon mit der Eigen- 
schaft erh6hter Lysin-Synthese und nachgewiesener Mutation in lysC an der Stelle 932 wurde 
mitLU1479lysC311ile bezeichnet). 
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Belsplel 5: Herstellung Ethionin-resistenter C. glutamicum StSmme 

Im zweiten Schritt der Stammkonstruktion wurde der erhaltene Stamm LU1479 lysC 31 1tle (Bei- 
5 spiel 4) behandelt, urn etne Ethionin-Resistenz (Kase, H. Nakayama K.Agr. Biol. Chem. 39 1 53- 
106 1 975 L-methionine production by methionine analog-resistant mutants of Corynebacterium 
glutamicum) zu induzieren: Eine Obemachtkultur in BHI-Medium (Difco) wurde in Citratpuffer 
(50mM pH 5,5) gewaschen und bei 30°C fOr 20 min mit N-Methyl-nitrosoguanidin (10mg/ml in 
50mM Citrat pH5,5) behandelt Nach der Behandlung mit dem chemischen Mutagen N-Methyl- 

1 0 nitrosoguanidin wurden die Zellen gewaschen (Citratpuffer 50mM pH 5,5) und auf ein Medium 
plattiert, das aus folgenden Komponenten, berechnet auf 500ml, zusammengesetzt war 10g 
(NH 4 )2SO4.0.5g KH 2 PO 4 ,0.5g K 2 HPO 4 ,0.125g MgS0 4 .7H 2 0, 21g MOPS, 50mg CaCI 2f 15mg 
Proteokatechuat, 0,5mg Biotin, 1mg Thiamin, 5g/l D,L-Ethionin (Sigma Chemicals Deutschland), 
pH 7,0. AuBerdem enthielt das Medium 0.5ml einer Spurensalzlflsung aus: 10g/l FeS0 4 7H 2 0, 

15 1 g/l MnS0 4 *H 2 0, 0.1 g/l ZnS0 4 *7H20, 0.02g/l CuS0 4 , 0.002g/l NiCI 2 *6H 2 0, Alle Salze wurden in 
0,1 M HCI geldsL Das fertig zusammengesteltte Medium wurde sterilfiltriert und nach Zugabe von 
40ml steriler 50% GlucoselOsung, mit flOssigem sterilem Agar in einer Endkonzentraton von 
1 ,5% Agar versetzt und in Kulturschalen ausgegossen. 

20 Auf Platten mit dem beschriebenen Medium wurden mutagenisierte Zellen aufgebracht und 3-7 
Tage bei 30°C inkubiert. Erhaltene Klone wurden isoliert, mindestens einmal auf dem Selekti- 
onsmedium vereinzelt und dann auf ihre Methionin-Produktivitat in einem SchOttelkolben in Me- 
dium II untersucht (siehe Beispiel 6 

25 Belsplel 6: Herstellung von Methionin mit dem Stamm LU1479 lysC 31 1ile ET-16. 

Die in Beispiel 5 hergestellten SHmme wurden auf einer Agar-Platte mit CM-Medium fOr 2 Tag 
bei 30°C angezogen. 

CM-Agan 

30 1 0,0 g/l D-Glucose, 2,5 g/l NaCI, 2,0 g/l Hamstoff, 1 0,0 g/l Bacto Pepton (Difco), 5,0 
g/l Yeast Extract (Difco), 5,0 g/l Beef Extract (Difco), 22,0 g/l Agar (Difco), autoklaviert (20 min., 
121°C) 
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AnschlielJend wurden die Zellen von der Platte abgekratzt und in Saline resuspendieit FOr die 
Hauptkultur wurden 10 ml Medium II und 0,5 g autoklaviertes CaC0 3 (Riedel de Haen) in einem 
1 00 ml Erlenmeyertolben mit der Zellsuspension bis zu einer OD600nm von 1,5 beimpft und fOr 
5 72h auf einem Orbitalschuttler mit 200 Upm bei 30°C inkubiert 

Medium II: 

40g/l Saccharose 

60g/l Meiasse (auf 1 00% Zuckergehalt berechnet) 

10 10g/l (NH 4 )2S0 4 
0.4g/l MgSO/7H 2 0 
0.6g/l KH 2 P0 4 
0.3mg/l Thiamin*HCI 

1mg/l Biotin (aus einer 1 mg/ml steril filtrierten StammlOsung die mit NH 4 OH auf pH 

1 5 8,0 eingestellt wurde) 

2mg/l FeS0 4 
2mg/l MnS0 4 

mit NH 4 OH auf pH 7,8 eingestellt, autoklaviert (121 °C, 20 min). ZusStzlich wird Vitamin B12 
(Hydroxycobalamin Sigma Chemicals) aus einer Stamml6sung (200 pg/ml, steril filtriert) bis zu 
20 einer Endkonzentration von 1 00 pg/l zugegeben 

Gebildetes Methionin, sowie andere AminosSuren in der KulturbrOhe wurde mit Hilfe der Ami- 
nosauresaure-Bestimmungsmethode von Agilent auf einer Agilent 1100 Series LC System 
HPLC. Eine Derivatisierung vor der SSulentrennung mit Ortho-Phthalaldehyd ertaubte die Quan- 
25 Wizierung der gebildeten Aminosauren. Die Auftrennung des Aminosauregemisch fand auf einer 
Hypersil AA-Sdule (Agilent) statt 

Solche Klone wurden isoliert, deren Methionin-Produktivitat mindestens doppelt so hoch war, wie 
die des Ausgangsstamm LU1479 lysC 311ile. Ein solcher Klon wurde fOr die weiteren Versuche 
30 eingesetzt und bekam die Bezeichnung LU1479 lysC 31 1 ile ET-16. 



Belsplel 7: Klonierung von metH aus Streptomyces coelicolor und Klonierung in das Plasmid pC 
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Phsdh metH_Sc 

a) Chromosomale DNA wurde aus Streptomyces coellcolor Stamm ATCC BAA-471 (von 
der American Type Strain Culture Collection, (ATCC) Atlanta, LISA, unteir der Bestellnummer 

5 BAA-471 D erhaittich) isoliert Chromosomale DNA aus C. glutamicum ATCC 1 3032 wurde nach 
Tauch et al. (1995) Plasmid 33:168-179 oder Eikmanns et aL (1994) Microbiology 140:1817- 
1828 prapariert 

Mit den Oligonukleotldprimer SEQ ID NO:69 und SEQ ID NO:70, der chromosomalen DNA aus 
10 C. glutamicum als Template und Pfu Turbo Polymerase (Fa. Stratagene) wurde mit Hilfe der 
Polymerase-Kettenreaktion (PCR) nach Standardmethoden. wie Innis et al. (1 990) PCR Proto- 
cols. A Guide to Methods and Applications, Academic Press, eln DNA Fragment von ca. 180 
Basenpaaren aus dem nichtkodierenden 5-Bereich (PromotorTegion) der Homoserindehydro- 
genase (HsDH) amplifiziert Das amplifizierte Fragment ist an seinem S'-Ende von einerXhol- 
1 5 Restriktionsschnittstelle und am 3-Ende von einem Qber das Oligo eingefOhrten zu metH aus 
Streptomyces coelicolor homologen Bereich flankiert 

SEQ ID NO:69 

* 5^GAGACTCGAGGGAAGGTGMTCGMTTTCGG-3' 
20 und 

SEQ ID NO:70 

5-GTCCCGGGGAGAACGCACGATTCTCCAAAAATAATCGC-3' 

Das erhaltene DNA Fragment wurde mit dem GFX™PCR, DNA and Gel Band Purification Kit 
25 (Amersham Pharmacia, Freiburg) nach Angaben des Herstellers gereinigt 

b) Ausgehend von der chromosomalen DNA aus Streptomyces coelicolor als Template fOr 
eine PCR Reaktion wurde mit den Oligonukleotidprimern SEQ ID NO:71 und SEQ ID NO:72 ein 
Teil von metH mit Hilfe des GC-RICH PCR Systems (Roche Diagnostics, Mannheim) nach An- 

30 gaben des Herstellers amplifiziert. Das amplifizierte Fragment ist an seinem 5-Ende von einem 
Ober das Oligo eingefOhrten, zur Promotorregion von HsDH aus C. glutamicum homologen 
Bereich flankiert 
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SEQ ID N0.71 

5'-GAATCGTGCGTTCTCCCCGGGAC -3 1 
und 

SEQ ID NO:72 

S'-GTAGTTGACCGAGTTGATCACC -3' 

Das ca. 1 ,4 kb grolle erhaltene DNA Fragment wurde mit dem GFX™PCR, DNA and Gel Band 
Purification Kit (Amersham Pharmacia, Freiburg) nach Angaben des Herstellers gereinigt 

c) In einer weiteren PCR Reaktion wurden die beiden oben ertialtenen Fragmente gemein- 
sam als Template eingesetzt Durch die mit dem Oligonukleotidprimem SEQ ID NO:71 und SEQ 
ID NO:70 eingebrachten, zu dem jeweils anderen Fragment homologen Bereichen, kommtes im 
Zuge der PCR-ReakBon zu einer Anlagerung beider Fragmente aneinander und einer Ver- 
langerung zu einem durchgehenden DNA-Strang durch die eingesetzte Polymerase. Die Stan- 
dardmethode wurde dahingehend modifiziert, dass die verwendeten Oligonukleotidprimer SEQ 
ID NO:69 und SEQ ID NO:72 erst mit Beginn des 2. Zyklus dem Reaktionsansatz zugegeben 
wurden. 

Das amplifizierte DNA Fragment von ungef§hr 1 ,6 kb wurde mit dem GFX™PCR, DNA and Gel 
Band Purification Kit nach Angaben des Herstellers gereinigt. Im Anschluss daran wuide es mit 
den Restriktionsenzymen Xhol und Notl (Roche Diagnostics, Mannheim) gespalten und gelelek- 
trophoretisch aufgetrennt AnschlieBend wurde das ca. 1,6 kb grolle DNA Fragment mit 
GFX™PCR, DNA and Gel Band Purification Kit (Amersham Pharmacia, Freiburg) aus der Aga- 
rose aufgereinigt 

d) Der noch fehlende S'-Bereich von metH wurde ausgehend von der chromosomalen DNA 
aus Streptomyces coelicolor als Template mit den Oligonukleotidprimem SEQ ID NO:73 und 
SEQ ID NO:74 mit Hilfe des GC-RICH PCR Systems (Roche Diagnostics, Mannheim) nach An- 
gaben des Herstellers amplifiziert. Das amplifizierte Fragment ist an seinem 3'-Ende von einer 
Ober das Oligo eingefOhrten EcoRV-Restriktionsschnittstelle flankiert 



SEQ ID NO:73 

S'-CCGGCCTGGAGAAGCTCG-S* 
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und 

SEQ ID NO:74 

S'-GAGAGATATCCCTCAGCGGGCGTTGAAG-S' 

5 Das erhaltene ca. 2,2 kb groBe DNA Fragment wurde mit dem GFX™PCR, DNA and Gel Band 
Purification Kit (Amersham Pharmacia, Freiburg) nach Angaben des Herstellers gereinigt Im 
AnschluB daran wurde es mit den Restriktionsenzymen Notl und EcoRV (Roche Diagnostics, 
Mannheim) gespalten und gelelektrophoretisch aufgetrennt Anschlieliend wurde das ca. 2,2 kb 
groBe DNA Fragment mit GFX™PCR, DNA and Gel Band Purification Kit (Amersham Phar- 
1 0 macia, Freiburg) aus der Agarose aufgereinigt 

e) Der Vektor pClik5MCS SEQ ID NO:63 (Beispiel 1) wurde mit den Restriktionsenzymen 
Xhol und EcoRV (Roche Diagnostics, Mannheim) geschnitten und ein 5 kb groBes Fragment 
nach elektrophoretischer Auftrennung mit GFX^PCR, DNA and Gel Band Purification Kit isoliert 

Das Vektorfragment wurde zusammen mit den beiden geschnittenen und aufgereinigten PCR- 
Fragmenten mit Hilfe des Rapid DNA Ligation Kit (Roche Diagnostics, Mannheim) nach An- 
gaben des Herstellers ligiert und der Ligationsansatz nach Standardmethoden wie In Sambrook 
et al. (Molecular Cloning. A Laboratory Manual, Cold Spring Harbor, beschrieben(1989)), in 
kompetente E.coli XL-1 Biue (Stratagene, La Jolla, USA) transformiert Eine Selektion auf Plas- 
mid-tragende Zellen wurde durch das Ausplattieren auf Kanamycin (20pg/ml) halMgen LB Agar 
(Lennox, 1955, Virology, 1:190) erreichL 

Die Praparation der Plasmid DNA wurde nach Methoden und mit Materialien der Fa. Quiagen 
durchgefOhrt Sequenzierungsreaktionen wurden nach Sanger et al. (1977) Proceedings of the 
National Academy of Sciences USA 74:5463-5467 durchgefOhrt. Die Sequenaerreaktionen wur- 
den mittels ABI Prism 377 (PE Applied Biosystems, Weiterstadt) aufgetrennt und ausgewerteL 

Das entstandene Plasmid pC Phsdh metH_Sc (Streptomyces coelicotor) ist als SEQ ID NO:79 
aufgefOhrt. 

Die Sequenz SEQ ID NO:79 umfasst die folgenden wesentlichen Teilbereiche: 



I 
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Position 


Art der Se- 
quenz 


Beschreibung 


6-155 


Promoter 


HsDH 


156-3752 


CDS 1 ' 


MetH S. coelicolor 


4153-4944 


CDS 


Kanamycin-Resistenz 


5211 -6071 
(complement) 2 * 


CDS 


Replikationsurspoing/ 
E.coli/Plasmid pMB 



Kodierende Sequenz 
auf Komplementarstrang 



Belspiel 8: Transformation des Stammes LU1479 lysC 31 1 ile ET-16 mitdem Plasmid pC Phsdh 
metH_Sc 

Der Stamm LU1479 lysC 31 1 ile ET-1 6 (Beispiel 5) wurde mit dem Plasmid pC Phsdh metH_Sc 
(Beispiel 7) nach der beschriebenen Methode (Liebl, et al. (1989) FEMS Microbiology Letters 
53:299-303) transformiert Die Transformationsmischung wurde auf CM-Platten plattiert, die 
zusatzlich 20mg/l Kanamycin enthielten, urn eine Selektion auf Plasmid-haltlge Zellen zu errei- 
chen. Erhaltene Kan-resistente Klone wurden gepickt und vereinzelt Die Methionin-Produktivitat 
der Klone wurde in einem SchOttelkolbenversuch (s. Beispiel 6) untersucht Der Stamm LU1479 
lysC 31 1ile ET-16 pC Phsdh metH_Sc produzierte im Vergleich zu LU1479 lysC 31 lite ET-16 
signifikant mehr Methionin. 
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PatentansprOche 

1. Verfahren zur fermentativen Herstellung wenigstens einer schwefelhaltigen 
5 Feinchemikalie, welches folgende Schritte umfasst 

a) Fermentation einer die gewOnschte schwefelhaltige Feinchemikalie 
produzierenden coryneformen Bakterienkultur, wobei in den coryneformen 
Bakterien zumindest eine heterologe Nukleotidsequenz exprimiert wird, 
welche fOr ein Protein mit Methionin-Synthase (metF) -Aktivitat kodiert; 
10 b) Anreicherung der schwefelhaltigen Feinchemikalie im Medium oder in den 

Zellen der Bakterien, und 
c) Isolieren der schwefelhaltigen Feinchemikalie. 

2. Verfahren nach Anspmch 1, wobei die schwefelhaltige Feinchemikalie L-Methionln 
15 umfasst 

3. Verfahren nach einem der vorhergehenden Ansprfiche, wobei sich die heterologe metF- 
kodierende Nukleotidsequenz zur metF-kodierenden Sequenz aus Corynebacterium 
glutamicum ATCC 13032 eine Sequenzhomologie vom weniger als 100% aufweist 

20 

4. Verfahren nach Anspruch 3, wobei die metF-kodierende Sequenz aus einem der 
folgenden Organismen abgeleitet ist 



Organimsus 


StammsammluDE 


Corynebacterium diphteriae 


ATCC 14779 


Streptomyces lividans 


ATCC 19844 


Streptomyces coelicolor 


ATCC 10147 


Aquifex aeolicus 


DSM 6858 


Burkholderia cepacia 


ATCC 25416 


Nitrosomonas europaea 


ATCC 19718 


Pseudomonas aeruginosa 


ATCC 17933 


Xylella fastidiosa 


ATCC 35881 


Pseodomonas fluorescens 


ATCC 13525 


Schizosaccharomyces pombe 


ATCC 24969 


Saccharomyces cerevisiae 


ATCC 10751 


Erwinia carotovora 


ATCC 15713 


Klebsiella pneumoniae 


ATCC 700721 


Salmonella typhi 


ATCC 12839 


Salmonella typhimurium 


ATCC 15277 1 


Escherichia coli K12 


ATCC55151 
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Vibno cholerae 


ATCC 


39315 


Haemophilus influenzae 


ATCC 


51907 


cauiooacier crescentus 


ATCC 


19089 


Actinobacillus 


ATCC 


33384 


actinomycetemcomitans 






Neisseria meningitis 


ATCC 


6253 


Rhodobacter capsulatus 


ATCC 


11166 


Campylobacter jejuni 


ATCC 


33560 


Lactococcus lactis 


ATCC 


7962 


Prochlorococcus marinus 


PCC71 


18 


Bacillus stearothermophilus 


ATCC 


12980 



5. Verfahren nach einem der vorhergehenden AnsprQche, wobei die metF-kodierende 
Sequenz eine kodierende Sequenz gemaiX SEQ ID NO:1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 
21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51 und 53 Oder eine dazu 

5 homologe Nukleotidsequenz, welche fQr ein Protein mit metF-Aktivitat kodiert, umfasst 

6. Verfahren nach einem der vorhergehenden AnsprQche, wobei die metF-kodierende 
Sequenz fQr ein Protein mit metF-Aktivitat kodiert, wobei das Protein eine 
AminosauresequenzgemSB SEQ ID NO:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 

10 30, 32, 34, 36, 38. 40, 42, 44, 46, 48, 50, 52 und 54 Oder eine dazu homologe 

Aminosauresequenz, welche fQr ein Protein mit metF-Aktivitat steht, umfasst 

7. Verfahren nach einem der vorhergehenden AnsprQche, wobei die kodierende metF- 
Sequenz eine in coryneformen Bakterien replizierbare oder eine stabil in das 

1 5 Chromosom intregrierte DNA Oder eine RNA ist 

8. Verfahren gemaB Anspruch 7, wobei man 

a) einen mit einem Plasmidvektor transformierten Bakterienstamm einsetzt der 
20 wenigstens eine Kopie der kodierenden metF-Sequenz unter der Kontrolle 

regulativer Sequenzen trSgt, oder 

b) einen Stamm einsetzt, in dem die kodierende metF-Sequenz in das Chromosom 
des Bakteriums integriert wurde 

25 9. Verfahren nach einem der vorhergehenden AnsprQche, wobei die kodierende metF- 
Sequenz uberexprimiert wird. 
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10. Verfahren gemSB einem der vorhergehenden AnsprOche, wobei man Bakterien 
fermentiert, in denen zusStzlich wenigstens ein weiteres Gen des Biosyntheseweges der 
gewflnschten schwefelhaltigen Feinchemikalie verstflrkt ist Oder derart mutiert ist dass 

5 es durch Stoffwechselmetabolite nicht in seiner Aktivitat beeinflusst wird. 

11. Verfahren gemSB einem der vorhergehenden AnsprOche, wobei man Bakterien 
fermentiert, in denen wenigstens ein Stoffwechsetweg zumindest teilweise ausgeschaltet 
sind, der die Bildung der gewOnschten schwefelhaltigen Feinchemikalie verringert 

10 

12. Verfahren gem3& einem der vorhergehenden AnsprOche, wobei man coryneforme 
Bakterien fermentiert, in denen gleichzeitig wenigstens eines der Gene, ausgewShK unter 

a) dem fQr eine Aspartatkinase kodierenden Gen lysC, 

15 b) dem fQr die Glycerinaldehyd-3-Phosphat Dehydrogenase kodierenden Gen gap, 

c) dem fOr die 3-Phosphoglycerat Kinase kodierenden Gen pgk, 

d) dem fur die Pyruvat Carboxylase kodierenden Gen pyc, 

e) dem fQr die Triosephosphat Isomerase kodierenden Gen tpi, 

f) dem fQr die Homoserin O-Acetyltransferase kodierenden Gen metA, 
20 g) dem fQr die Cystahionin-gamma-Synthase kodierenden Gen metB, 

h) dem fQr die Cystahionin-gamma-Lyase kodierenden Gen metC, 

i) dem fQr die Serin-Hydroxymethyltransferase kodierenden Gen glyA, 

j) dem fQr die O-Acetyfhomoserin-Sulfhydrylase kodierenden Gen metY, 

k) dem fur das metH Gen, das fOr die Vitamin B1 2 abhangige Methionin-Synthase 

25 kodiert, 

i) dem fur das serC Gen, das fOr die Phosphoserin-Aminotransferase kodiert 

m) dem serB Gen, das fOr die Phosphoserin-Phosphatase kodiert 

n) dem cysE Gen, das fur die Serine Acetyl-Transferase kodiert, und 

o) dem horn Gen, das eine Homoserin-Dehydrogenase kodiert, 



30 



Oberexprimiert oder so mutiert ist, dass die korrespondierenden Proteine, verglichen mit 
nicht mutierten Proteinen, in geringerem MaBe oder nicht durch Stoffwechselmetabolite 
in ihrer Aktivitat beeinflusst werden. 
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13. Verfahren gemSB einem der vorhergehenden AnsprOche, wobei man coryneformen 
Bakterien fermentiert, in denen gleichzeitig wenigstens eines der Gene, ausgewShlt unter 

a) dem fOr die Homoserine-Kinase kodierenden Gen thrB, 

b) dem fOr die Threonin Dehydratase kodierenden Gen ilvA, 

c) dem fQr die Threonin Synthase kodierenden Gen thrC 

d) dem fQr die Meso-Diaminopimelat D-Dehydrogenase kodierenden Gen ddh 

e) dem fQr die Phosphoenolpyruvat-Carboxykinase kodierenden Gen pck, 

f) dem fOr die Glucose-6-Phosphat-6-lsomerase kodierenden Gen pgi, 

g) dem fur die Pyruvat-Oxidase kodierenden Gen poxB, 

h) dem fQr die Dihydrodipicolinat Synthase kodiemden Gen dapA, 

i) dem fQr die Dihydrodipicolinat Reduktase kodiemden Gen dapB; oder 
j) dem fQr die Diaminopicolinat Decarboxylase kodiemden Gen 

durch VerSnderung der Expressionsrate oder durch EinfOhrung einer gezietten Mutation 
abschwScht ist 

14. Verfahren gemSli einem oder mehreren der vorhergehenden AnsprOche, wobei man 
Mikroorganismen der Art Corynebacterium glutamicum einsetzt. 

15. Verfahren zur Herstellung eines L-Methionin haltigen Tierfuttermittel-Additivs aus 
FermentationsbrQhen, welches folgende Schritte umfasst 

a) Kultivierung und Fermentation eines L-Methionin produzierenden 
Mikroorganismus in einem Fermentationsmedium; 

b) Entfemung von Wasser aus der L-Methionin haltigen FermentationsbrOhe; 

c) Entfemung der wShrend der Fermentation gebildeten Biomasse in einer Menge 
von 0 bis 100 Gew.-%; und 

d) Trocknung der gemSft b) und/oder c) erhaltenen FermentationsbrOhe, urn das 
Tierfuttermittel-Additiv in der gewunschten Putver- oder Granulatform zu erhalten. 

16. Verfahren gemaii Anspmch 15, wobei man Mikroorganismen gemaB der Definition in 
einem der AnsprOche 1 bis 14 einsetzt 
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SEQUENZPROTOKOLL 

<110> BASF Aktiengesellschaft 

<120> MetH 

<130> M/43120 

<140> 

<141> 

<160> 79 

<210> 1 
<211> 3597 
<212> DNA 

<213> Streptomyces coelicolor 

<220> 
<221> CDS 
<222> (1)..{3594) 
<223> RSX14254 

<400> 1 

gtg cgt tct ccc egg gac gtc cca cga egg gcg gca ccg ggc aga ggc 48 
Val Arg Ser Pro Arg Asp Val Pro Arg Arg Ala Ala Pro Gly Arg Gly 
1 5 io 15 

aaa gec gac age cgt cgc ate eta ggg age cct ttc atg gee teg teg 96 
Lys Ala Asp Ser Arg Arg He Leu Gly Ser Pro Phe Met Ala Ser Ser 
20 25 30 

cca tec ace ccg ccc gee gac ace cgc ace cgc gtg tec gee etc cga 144 
Pro Ser Thr Pro Pro Ala Asp Thr Arg Thr Arg Val Ser Ala Leu Arg 
35 40 45 

gag gee etc gee ace cgc gtg gtg gtc gee gac ggc gee atg ggc ace 192 
Glu Ala Leu Ala Thr Arg Val Val Val Ala Asp Gly Ala Met Gly Thr 
50 55 60 



240 



288 



atg etc cag gee cag aac ccc acg ctg gac gac ttc cag cag etc gaa 
Met Leu Gin Ala Gin Asn Pro Thr Leu Asp Asp Phe Gin Gin Leu Glu 
65 70 75 80 

ggg tgc aac gag gtc ctg aac etc ace egg ccc gac ate gtc cgc teg 
Gly Cys Asn Glu Val Leu Asn Leu Thr Arg Pro Asp He Val Arg Ser 
85 90 95 

gtg cac gag gag tac ttc gcg gee ggc gtc gac tgc gtc gag ace aac 336 
Val His Glu Glu Tyr Phe Ala Ala Gly Val Asp Cys Val Glu Thr Asn 

105 no 

acc ttc ggc gee aac cac tec gee ctg ggc gag tac gac ate ccc gag 384 
Thr Phe Gly Ala Asn His Ser Ala Leu Gly Glu Tyr Asp He Pro Glu 
^5 120 125 

cgc gtc cac gaa ctg tec gag gee ggc gee cgc gtc gee cgc gag gtc 432 
Arg Val His Glu Leu Ser Glu Ala Gly Ala Arg Val Ala Arg Glu Val 
130 135 140 



gec gac gag ttc ggc gee cgc gac ggc egg cag cgc tgg gtg ctg ggc 



480 
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Ala Asp Glu Phe Gly Ala Arg Asp Gly Arg Gin Arg Trp Val Leu Gly 
145 150 155 160 

tec atg ggc ccc ggc acc aag etc ccc acc etc ggc cac gee ccg tac 528 
Ser Met Gly Pro Gly Thr Lys Leu Pro Thr Leu Gly His Ala Pro Tyr 
165 170 175 

acc gtc ctg cgc gac gee tac cag cgc aac gec gag gga ctg gtc gcg 576 
Thr Val Leu Arg Asp Ala Tyr Gin Arg Asn Ala Glu Gly Leu Val Ala 
180 185 190 

ggc ggc gcg gac gca ctg ctg gtg gag acc acg cag gac ctg etc cag 624 
Gly Gly Ala Asp Ala Leu Leu Val Glu Thr Thr Gin Asp Leu Leu Gin 
195 200 205 

acc aag gec teg gtg etc ggc gec egg cgc gec ctg gac gtc etc ggc 672 
Thr Lys Ala Ser Val Leu Gly Ala Arg Arg Ala Leu Asp Val Leu Gly 
210 215 220 

etc gac ctg ccg etc ate gtg tec gtc acc gtc gag acc acc ggc acc 720 
Leu Asp Leu Pro Leu He Val Ser Val Thr Val Glu Thr Thr Gly Thr 
225 230 235 240 

atg ctg etc ggc teg gag ate ggc_gcc gcg etc acc. gcg ctg gaa ccg 768 
Met Leu Leu Gly Ser Glu He Gly Ala Ala Leu Thr Ala Leu Glu Pro 
245 250 255 

etc ggc ate gac atg ate ggc ctg aac tgc gec acc ggc ccc gec gag 816 
Leu Gly He Asp Met He Gly Leu Asn Cys Ala Thr Gly Pro Ala Glu 
260 265 270 

atg age gag cac ctg cgc tac etc gee egg cac tec cgc ate ccg ctg 864 
Met Ser Glu His Leu Arg Tyr Leu Ala Arg His Ser Arg He Pro Leu 
275 280 285 

acc tgc atg ccc aac gee ggt ctg ccc gtc etc ggc aag gac ggc gec 912 
Thr Cys Met Pro Asn Ala Gly Leu Pro Val Leu Gly Lys Asp Gly Ala 
290 295 300 

cac tac ccg ctg acc gcg ccc gag ctg gee gac gca cac gag acc ttc 960 
His Tyr Pro Leu Thr Ala Pro Glu Leu Ala Asp Ala His Glu Thr Phe 
305 310 315 320 

gtg cgc gag tac ggc ctg tec ctg gtc ggc ggc tgc tgc ggc acc acg 1008 
Val Arg Glu Tyr Gly Leu Ser Leu Val Gly Gly Cys Cys Gly Thr Thr 
325 330 335 



ccc gag cac ctg cgc cag gtc gtc gag egg gtc egg gac acc gec ccc 
Pro Glu His Leu Arg Gin Val Val Glu Arg Val Arg Asp Thr Ala Pro 
340 345 350 



1056 



acc gca cgc gac ccg cgc ccc gag ccc ggc gec gee teg etc tac cag 1104 
Thr Ala Arg Asp Pro Arg Pro Glu Pro Gly Ala Ala Ser Leu Tyr Gin 
355 360 365 

acc gtg ccc ttc cgc cag gac acc tec tac ctg gee ate ggc gag cgc 1152 
Thr Val Pro Phe Arg Gin Asp Thr Ser Tyr Leu Ala He Gly Glu Arg 
370 375 380 

acc aac gee aac ggg tec aag aag ttc cgc gag gee atg ctg gac ggc 1200 
Thr Asn Ala Asn Gly Ser Lys Lys Phe Arg Glu Ala Met Leu Asp Gly 
385 390 395 400 
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cgc tgg gac gac tgc gtc gag atg gcc cgc gac cag ate cgc. gaa ggc 1248 
Arg Trp Asp Asp Cys Val Glu Met Ala Arg Asp Gin lie Arg Glu Gly 
405 410 415 

gcg cac atg etc gac etc tgc gtc gac tac gtc ggc egg gac ggp gtc 1296 
Ala His Met Leu Asp Leu Cys Val Asp Tyr Val Gly Arg Asp Gly Val 
420 425 430 

gcc gac atg gag gaa ctg gcc ggc egg ttc gcc ace gcc tec acg ctg 1344 
Ala Asp Met Glu Glu Leu Ala Gly Arg Phe Ala Thr Ala Ser Thr Leu 
435 440 445 

ccg ate gtc etc gac tec acc gag gtc gac gtc ate egg gcc ggc ctg 1392 
Pro He Val Leu Asp Ser Thr Glu Val Asp Val He Arg Ala Gly Leu 
450 455 460 

gag aag etc ggc ggc cgc gcg gtg ate aac teg gtc aac tac gag gac 1440 
Glu Lys Leu Gly Gly Arg Ala Val He Asn Ser Val Asn Tyr Glu Asp 
465 470 475 * 480 

ggc gcc ggc ccc gag tec egg ttc gcc cgc gtc acg aag etc gcc egg 1488 
Gly Ala Gly Pro Glu Ser Arg Phe Ala Arg Val Thr Lys Leu Ala Arg 
485 490 495 

gag cac ggc gcc gcg ctg ate gcg ctg acc ate gac gag gtg gga cag 1536 
Glu His Gly Ala Ala Leu lie Ala Leu Thr He Asp Glu Val Gly Gin 
500 505 510 

gcc cgc acc gcc gag aag aag gtc gag ate gcc gaa egg etc ate gac 1584 
Ala Arg Thr Ala Glu Lys Lys Val Glu He Ala Glu Arg Leu He Asp 
515 520 525 

gac etc acc ggc aac tgg ggc ate cac gag tec gac ate etc gtc gac 1632 
Asp Leu Thr Gly Asn Trp Gly He His Glu Ser Asp He Leu Val Asp 
530 535 540 

tgc ctg acc ttc acc ate tgc acc ggc cag gag gag tec cgc aag gac 1680 
Cys Leu Thr Phe Thr He Cys Thr Gly Gin Glu Glu Ser Arg Lys Asp 
545 550 555 560 

ggc ctg gcc acc ate gag ggc ate egg gaa etc aag egg cgc cac ccg 1728 
Gly Leu Ala Thr He Glu Gly He Arg Glu Leu Lys Arg Arg His Pro 
565 570 ~ 575 

gac gtg cag acc acg etc ggc ctg teg aac ate tec ttc ggc etc aac 1776 
Asp Val Gin Thr Thr Leu Gly Leu Ser Asn He Ser Phe Gly Leu Asn 
580 585 590 

ccg gcc gcc cgc ate ctg etc aac tec gtc ttc etc gac gaa tgc gtc 1824 
Pro Ala Ala Arg He Leu Leu Asn Ser Val Phe Leu Asp Glu Cys Val 
595 600 605 

aag gcc ggc ctg gac teg gcc ate gtg cac gcg age aag ate ctg ccg 1872 
Lys Ala Gly Leu Asp Ser Ala He Val His Ala Ser Lys He Leu Pro 
610 615 620 

ate gcc cgc ttc gac gag gag cag gtc acc acc gcc etc gac ttg ate 1920 
He Ala Arg Phe Asp Glu Glu Gin Val Thr Thr Ala Leu Asp Leu He 
625 630 635 640 

tac gac cgc cgc cgc gag ggc tac gac ccc ctg caa aag etc atg cag 1968 
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Tyr Asp Arg Arg Arg Glu Gly Tyr Asp Pro Leu Gin Lys Leu Met Gin 
645 650 655 

etc ttc gag ggc gec acc gec aag teg ctg aag gee tec aag gee gag 2016 
Leu Phe Glu Gly Ala Thr Ala Lys Ser Leu Lys Ala Ser Lys Ala Glu 
660 665 670 

gaa ctg gec gec etc ccg ctg gag gag cgc etc aag cgc cgc ate ate 2064 
Glu Leu Ala Ala Leu Pro Leu Glu Glu Arg Leu Lys Arg Arg He He 
675 680 665 

gac ggc gag aag aac ggc etc gaa cag gac etc gac gag gee etc egg 2112 
Asp Gly Glu Lys Asn Gly Leu Glu Gin Asp Leu Asp Glu Ala Leu Arg 
690 695 700 

gag cgc ccg gee etc gag ate gtc aac gac acc ctg etc gac ggt atg 2160 
Glu Arg Pro Ala Leu Glu He Val Asn Asp Thr Leu Leu Asp Gly Met 
705 710 715 720 

aag gtc gtc ggc gag ctg ttc ggc tec ggc cag atg cag ctg ccg ttc 2208 
Lys Val Val Gly Glu Leu Phe Gly Ser Gly Gin Met Gin Leu Pro Phe 
725 730 735 

gtg etc cag tec gee gag gtc atg aag acc gcg gtg gec cac ctg gag 2256 
Val Leu Gin Ser Ala Glu Val Met Lys Thr Ala Val Ala His Leu Glu 
740 745 750 

ccg cac atg gag aag acc gac gac gac ggc aag ggc acg ate gtg ctg 2304 
Pro His Met Glu Lys Thr Asp Asp Asp Gly Lys Gly Thr He Val Leu 
755 760 765 

gee acc gtc cgc ggc gac gtc cac gac ate ggc aag aac etc gtc gac 2352 
Ala Thr Val Arg Gly Asp Val His Asp He Gly Lys Asn Leu Val Asp 
770 775 780 

ate ate ctg tec aac aac ggc tac aac gtc gtc aac etc ggc ate aag 2400 
He He Leu Ser Asn Asn Gly Tyr Asn Val Val Asn Leu Gly He Lys 
785 790 795 * 800 

cag ccc gtc tec gcg ate ctg gaa gcg gee gac gag cac egg gec gac 2448 
Gin Pro Val Ser Ala He Leu Glu Ala Ala Asp Glu His Arg Ala Asp 
805 810 815 

gtc ate ggc atg tec ggc etc etc gtc aag tec acg gtg ate atg aag 2496 
Val He Gly Met Ser Gly Leu Leu Val Lys Ser Thr Val He Met Lys 
820 825 830 

gag aac ctg gag gag ctg aac cag cgc aag ctg gee gee gac tac ccg 2544 
Glu Asn Leu Glu Glu Leu Asn Gin Arg Lys Leu Ala Ala Asp Tyr Pro 
835 840 845 

gtc ate etc ggc ggc gee gec etc acc agg gee tac gtc gaa cag gac 2592 
Val He Leu Gly Gly Ala Ala Leu Thr Arg Ala Tyr Val Glu Gin Asp 
850 855 860 

ctg cac gag ate tac gac ggc gag gtc cgc tac gee cgc gac gee ttc 2640 
Leu His Glu He Tyr Asp Gly Glu Val Arg Tyr Ala Arg Asp Ala Phe 
865 870 875 880 

gag ggc ctg cgc etc atg gac gec etc ate ggc ate aag cgc ggc gtg 2688 
Glu Gly Leu Arg Leu Met Asp Ala Leu He Gly lie Lys Arg Gly Val 
885 890 895 
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ccc ggc gcc aag ctg ccg gag ctg aag cag cgc egg gtg egg gee gee 
Pro Gly Ala Lys Leu Pro Glu Leu Lys Gin Arg Arg Val Arg Ala Ala 
900 905 910 , 



2736 



acc gtc gag ate gac gag cgc ccc gag gaa ggc cac gtc cgc tec gac 
Thr Val Glu He Asp Glu Arg Pro Glu Glu Gly His Val Arg Ser Asp 
915 920 925 



2784 



gtc gcc acc gac aac ccg gtc ccg acc ccg ccc ttc cgc ggc,. acc cgc 2832 
Val Ala Thr Asp ABn Pro Val Pro Thr Pro Pro Phe Arg Gly Thr Arg 
930 935 940 



gtc gtc aag ggc ate cag etc aag gag tac gcc tec tgg etc gac gag 
Val Val Lys Gly He Gin Leu Lys Glu Tyr Ala Ser Trp Leu Asp Glu 
945 950 955 960 



2880 



ggc gcc etc ttc aag ggc cag tgg ggc etc aag cag gcc cgc acc ggc 
Gly Ala Leu Phe Lys Gly Gin Trp Gly Leu Lys Gin Ala Arg Thr Gly 
965 970 975 



2928 



gag gga ccc tec tac gag gaa ctg gtc gag tec gag ggc egg ccg egg 
Glu Gly Pro Ser Tyr Glu Glu Leu Val Glu Ser Glu Gly Arg Pro Arg 
980 985 990 



2976 



ctg cgc ggc ctg etc gac egg etc cag acg gac aac ctt ttg gag gcg 3024 
Leu Arg Gly Leu Leu Asp Arg Leu Gin Thr Asp Asn Leu Leu Glu Ala 
995 1000 1005 

gcc gtg gtc tac ggc tac ttc ccc tgc gtc tec aag gac gac gac ctg 3072 
Ala Val Val Tyr Gly Tyr Phe Pro Cys Val Ser Lys Asp Asp Asp Leu 
1010 1015 1020 



ate gtc etc gac gac gac ggc aac gaa cgc acc cgc ttc acc ttc ccc 
He Val Leu Asp Asp Asp Gly Asn Glu Arg Thr Arg Phe Thr Phe Pro 
1025 1030 1035 1040 



3120 



cgc cag cgc cgc ggc egg cgc ctg tgc ctg gcc gac ttc ttc cgc ccg 
Arg Gin Arg Arg Gly Arg Arg Leu Cys Leu Ala Asp Phe Phe Arg Pro 



1045 



1050 



1055 



3168 



gag gag tec ggc gag acc gac gtg gtc ggc ttc cag gtc gtc acc gtc 
Glu Glu Ser Gly Glu Thr Asp Val Val Gly Phe Gin Val Val Thr Val 
1060 1065 1070 



3216 



ggc tec cgc ate ggc gag gag acg gcc cgc atg ttc gag gcc aac gcc 
Gly Ser Arg He Gly Glu Glu Thr Ala Arg Met Phe Glu Ala Asn Ala 
1075 1080 1085 



3264 



tac cgc gac tat etc gag ctg cac ggc ctg tec gtg cag etc gcc gag 
Tyr Arg Asp Tyr Leu Glu Leu His Gly Leu Ser Val Gin Leu Ala Glu 
1090 1095 1100 



3312 



gcc etc gcc gag tac tgg cac gcg cgc gtg cgc teg gaa etc ggc ttc 
Ala Leu Ala Glu Tyr Trp His Ala Arg Val Arg Ser Glu Leu Gly Phe 
1105 1110 1115 H20 



3360 



gcc ggg gag gac ccg gcc gag atg gag gac atg ttc gcc ctg aag tac 
Ala Gly Glu Asp Pro Ala Glu Met Glu Asp Met Phe Ala Leu Lys Tyr 
1125 H30 H35 



3408 



{ 
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egg ggt gec cgc ttc tec etc ggc tac ggc gec tgc ccc gac ctg gag 3456 
Arg Gly Ala Arg Phe Ser Leu Gly Tyr Oly Ala Cys Pro Asp Leu Glu 
1140 1145 1150 

gac cgc gee aag ate gec gec ctg ctg gag ccc gag cgc ate ggc gtc 3504 
Asp Arg Ala Lya lie Ala Ala Leu Leu Glu Pro Glu Arg lie Gly Val 
1155 1160 1165 

cac eta tec gag gag ttc cag etc cac ccc gag cag tec acc gac gec 3552 
His Leu Ser Glu Glu Phe Gin Leu His Pro Glu Gin Ser Thr Asp Ala 
1170 1175 1180 

ate gtc ate cac cac ccg gag gee aag tac ttc aac gec cgc 3594 
He Val He His His Pro Glu Ala Lys Tyr Phe Asn Ala Arg 
1185 1190 1195 

tga 3 597 



<210> 2 
<211> 1198 
<212> PRT 

<213> Streptomyces coelicolor 
<400> 2 

Val Arg Ser Pro Arg Asp Val Pro Arg Arg Ala Ala Pro Gly Arg Gly 
1 5 10 15 

Lys Ala Asp Ser Arg Arg He Leu Gly Ser Pro Phe Met Ala Ser Ser 
20 25 30 

Pro Ser Thr Pro Pro Ala Asp Thr Arg Thr Arg Val Ser Ala Leu Arg 
35 40 45 

Glu Ala Leu Ala Thr Arg Val Val Val Ala Asp Gly Ala Met Gly Thr 
50 55 60 

Met Leu Gin Ala Gin Asn Pro Thr Leu Asp Asp Phe Gin Gin Leu Glu 
65 70 75 80 

Gly Cys Asn Glu Val Leu Asn Leu Thr Arg Pro Asp He Val Arg Ser 
85 90 95 

Val His Glu Glu Tyr Phe Ala Ala Gly Val Asp Cys Val Glu Thr Asn 
100 105 ~ HO 

Thr Phe Gly Ala Asn His Ser Ala Leu Gly Glu Tyr Asp He Pro Glu 
115 120 125 

Arg Val His Glu Leu Ser Glu Ala Gly Ala Arg Val Ala Arg Glu Val 
130 135 140 

Ala Asp Glu Phe Gly Ala Arg Asp Gly Arg Gin Arg Trp Val Leu Gly 
145 150 155 160 

Ser Met Gly Pro Gly Thr Lys Leu Pro Thr Leu Gly His Ala Pro Tyr 
165 170 175 

Thr Val Leu Arg Asp Ala Tyr Gin Arg Asn Ala Glu Gly Leu Val Ala 
180 185 190 

Gly Gly Ala Asp Ala Leu Leu Val Glu Thr Thr Gin Asp Leu Leu Gin 
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195 200 205 

Thr Lys Ala Ser Val Leu Gly Ala Arg Arg Ala Leu Asp Val Leu Gly 
210 215 220 . ' 

Leu Asp Leu Pro Leu He Val Ser Val Thr Val Glu Thr Thr Gly Thr 
225 230 235 240 

Met Leu Leu Gly Ser Glu He Gly Ala Ala Leu Thr Ala Leu Glu Pro 
245 250 ( 255 

Leu Gly He Asp Met He Gly Leu Asn Cys Ala Thr Gly Pro Ala Glu 
260 265 270 

Met Ser Glu His Leu Arg Tyr Leu Ala Arg His Ser Arg He Pro Leu 
275 280 285 

Thr Cys Met Pro Asn Ala Gly Leu Pro Val Leu Gly Lys Asp Gly Ala 
290 295 300 

His Tyr Pro Leu Thr Ala Pro Glu Leu Ala Asp Ala His Glu Thr Phe 
305 310 315 320 

Val Arg Glu Tyr Gly Leu Ser Leu Val Gly Gly Cys Cys Gly Thr Thr 
325 330 335 

Pro Glu His Leu Arg Gin Val Val Glu Arg Val Arg Asp Thr Ala Pro 
340 345 350 

Thr Ala Arg Abp Pro Arg Pro Glu Pro Gly Ala Ala Ser Leu Tyr Gin 
355 360 365 

Thr Val Pro Phe Arg Gin Asp Thr Ser Tyr Leu Ala He Gly Glu Arg 
370 375 380 

Thr Asn Ala Asn Gly Ser Lys Lys Phe Arg Glu Ala Met Leu Asp Gly 
385 390 395 400 

Arg Trp Asp Asp Cys Val Glu Met Ala Arg Asp Gin He Arg Glu Gly 
405 410 415 

Ala His Met Leu Asp Leu Cys Val Asp Tyr Val Gly Arg Asp Gly Val 
420 425 430 

Ala Asp Met Glu Glu Leu Ala Gly Arg Phe Ala Thr Ala Ser Thr Leu 
435 440 445 

Pro He Val Leu Asp Ser Thr Glu Val Asp Val He Arg Ala Gly Leu 
450 455 460 

Glu Lys Leu Gly Gly Arg Ala Val He Asn Ser Val Asn Tyr Glu Asp 
465 470 475 480 

Gly Ala Gly Pro Glu Ser Arg Phe Ala Arg Val Thr Lys Leu Ala Arg 
485 490 495 

Glu His Gly Ala Ala Leu He Ala Leu Thr He Asp Glu Val Gly Gin 
500 505 510 

Ala Arg Thr Ala Glu Lys Lys Val Glu He Ala Glu Arg Leu He Asp 
515 520 525 
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Asp Leu Thr Gly Aen Trp Gly lie His Glu Ser Asp He Leu Val Asp 
530 535 540 

Cye Leu Thr Phe Thr He Cys Thr Gly Gin Glu Glu Ser Arg Lys Asp 
5 « 550 555 560 

Gly Leu Ala Thr He Glu Gly He Arg Glu Leu Lys Arg Arg His Pro 
565 570 575 

Asp Val Gin Thr Thr Leu Gly Leu Ser Asn He Ser Phe Gly Leu Asn 
580 585 590 

Pro Ala Ala Arg He Leu Leu Asn Ser Val Phe Leu Asp Glu Cys Val 
595 600 605 

Lys Ala Gly Leu Asp Ser Ala He Val His Ala Ser Lys lie Leu Pro 
610 615 620 

He Ala Arg Phe Asp Glu Glu Gin Val Thr Thr Ala Leu Asp Leu He 
625 630 635 640 

Tyr Asp Arg Arg Arg Glu Gly Tyr Asp Pro Leu Gin Lys Leu Met Gin 
645 650 655 

Leu Phe Glu Gly Ala Thr Ala Lys Ser Leu Lys Ala Ser Lys Ala Glu 
660 665 670 

Glu Leu Ala Ala Leu Pro Leu Glu Glu Arg Leu Lys Arg Arg He He 
675 680 ~ 685 

Asp Gly Glu Lys Asn Gly Leu Glu Gin Asp Leu Asp Glu Ala Leu Arg 
690 695 700 

Glu Arg Pro Ala Leu Glu He Val Asn Asp Thr Leu Leu Asp Gly Met 
70S 710 715 720 

Lys Val Val Gly Glu Leu Phe Gly Ser Gly Gin Met Gin Leu Pro Phe 
725 730 735 

Val Leu Gin Ser Ala Glu Val Met Lys Thr Ala Val Ala His Leu Glu 
740 745 750 

Pro His Met Glu Lys Thr Asp Asp Asp Gly Lys Gly Thr He Val Leu 
755 760 765 

Ala Thr Val Arg Gly Asp Val His Asp He Gly Lys Asn Leu Val Asp 
770 775 780 

He He Leu Ser Asn Asn Gly Tyr Asn Val Val Asn Leu Gly He Lys 
785 790 795 800 

Gin Pro Val Ser Ala He Leu Glu Ala Ala Asp Glu His Arg Ala Asp 
805 810 815 

Val He Gly Met Ser Gly Leu Leu Val Lys Ser Thr Val He Met Lys 
820 825 830 

Glu Asn Leu Glu Glu Leu Asn Gin Arg Lys Leu Ala Ala Asp Tyr Pro 
835 840 845 

Val He Leu Gly Gly Ala Ala Leu Thr Arg Ala Tyr Val Glu Gin Asp 
850 855 860 



WO 03/087386 PCT/EP03/04010 

9 

Leu His Glu He Tyr Asp Gly Glu Val Arg Tyr Ala Arg Asp Ala Phe 

86 5 870 875 880 

Glil Gly Leu Arg Leu Met Asp Ala Leu He Gly He Lys Arg Gly Val 
885 890 895 

Pro Gly Ala Lys Leu Pro Glu Leu Lys Gin Arg Arg Val Arg Ala Ala 
900 90S 910 

Thr Val Glu lie Asp Glu Arg Pro Glu Glu Gly His Val Arg Ser Asp 
915 920 925 

Val Ala Thr Asp Asn Pro Val Pro Thr Pro Pro Phe Arg Gly Thr Arg 
930 935 940 

Val Val Lys Gly He Gin Leu Lys Glu Tyr Ala Ser Trp Leu Asp Glu 
945 950 955 960 

Gly Ala Leu Phe Lys Gly Gin Trp Gly Leu Lys Gin Ala Arg Thr Gly 
965 970 975 

Glu Gly Pro Ser Tyr Glu Glu Leu Val Glu Ser Glu Gly Arg Pro Arg 
980 985 990 

Leu Arg Gly Leu Leu Asp Arg Leu Gin Thr Asp Asn Leu Leu Glu Ala 
995 1000 1005 

Ala Val Val Tyr Gly Tyr Phe Pro Cys Val Ser Lys Asp Asp Asp Leu 
1010 1015 1020 

He Val Leu Asp Asp Asp Gly Asn Glu Arg Thr Arg Phe Thr Phe Pro 
1025 1030 1035 1040 

Arg Gin Arg Arg Gly Arg Arg Leu Cys Leu Ala Asp Phe Phe Arg Pro 
1045 1050 * 1055 

Glu Glu Ser Gly Glu Thr Asp Val Val Gly Phe Gin Val Val Thr Val 
1060 1065 1070 

Gly Ser Arg lie Gly Glu Glu Thr Ala Arg Met Phe Glu Ala Asn Ala 
1075 1080 1085 

Tyr Arg Asp Tyr Leu Glu Leu His Gly Leu Ser Val Gin Leu Ala Glu 
1090 1095 lioo 

Ala Leu Ala Glu Tyr Trp His Ala Arg Val Arg Ser Glu Leu Gly Phe 
!105 1110 ins 1120 

Ala Gly Glu Asp Pro Ala Glu Met Glu Asp Met Phe Ala Leu Lys Tyr 
H25 H30 1135 

Arg Gly Ala Arg Phe Ser Leu Gly Tyr Gly Ala Cys Pro Asp Leu Glu 
H40 H45 H50 

Asp Arg Ala LyB He Ala Ala Leu Leu Glu Pro Glu Arg He Gly Val 
1155 H60 H65 

His Leu Ser Glu Glu Phe Gin Leu His Pro Glu Gin Ser Thr Asp Ala 
1170 1175 H80 

He Val He His His Pro Glu Ala Lys Tyr Phe Asn Ala Arg 
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1185 H90 H95 

<210> 3 

<211> 3537 

<212> DNA 

<213> Anabaena sp. 

<220> 

<221> CDS 

<222> (1) . . (3534) 

<223> RAN03790 

<400> 3 

atg act cat cct ttc ctg aaa cgc ctg cac agt ccg gaa ctt ccg gtt 48 

Met Thr His Pro Phe Leu Lys Arg Leu His Ser Pro Glu Leu Pro Val 
15 10 15 



ate gtc ttc gac ggt gca atg gga act aac eta caa acc caa aac etc 
lie Val Phe Asp Gly Ala Met Gly Thr Asn Leu Gin Thr Gin Asn Leu 
20 25 30 



96 



acg get gag gat ttc ggc ggt gtg cag tat gaa ggt tgt aac gaa tac 144 
Thr Ala Glu Asp Phe Gly Gly Val Gin Tyr Glu Gly Cys Asn Glu Tyr 
35 40 45 

eta gtc cac acc aaa ccc gaa get gtc gee aag gtt cac cgc gac ttt 192 
Leu Val His Thr Lys Pro Glu Ala Val Ala Lys Val His Arg Asp Phe 
SO 55 60 

etc get gtg ggt gca gat gtc ate gaa acc gac act ttc ggt gcg aca 240 
Leu Ala Val Gly Ala Asp Val He Glu Thr Asp Thr Phe Gly Ala Thr 
65 70 75 80 

tec att gtt ttg gcg gaa tat gac tta gca gac caa aca tat tac ctg 288 
Ser lie Val Leu Ala Glu Tyr Asp Leu Ala Asp Gin Thr Tyr Tyr Leu 
85 90 95 

aac aag aaa gee gee gaa ctg gcg aaa agt gtc get get gaa ttt tec 336 
Asn Lys Lys Ala Ala Glu Leu Ala Lys Ser Val Ala Ala Glu Phe Ser 
100 105 no 

aca cca gat aaa ccc egg ttt gtt get ggt tec ate ggc ccc aca acc 384 
Thr Pro Asp Lys Pro Arg Phe Val Ala Gly Ser He Gly Pro Thr Thr 
H5 120 125 

aaa ctt ccc acc ttg gga cat ate gac ttt gac act etc aaa act tgc 432 
Lys Leu Pro Thr Leu Gly His He Asp Phe Asp Thr Leu Lys Thr Cys 
130 135 14Q 

ttt get gaa caa gca gaa gcg ctg tta gat ggt ggc gtg gat tta ctt 480 
Phe Ala Glu Gin Ala Glu Ala Leu Leu Asp Gly Gly Val Asp Leu Leu 
145 150 155 160 

ttg gtg gag act tgt caa gat gtg ctg caa ate aaa gcg gcg ctg aat 528 
Leu Val Glu Thr Cys Gin Asp Val Leu Gin lie Lys Ala Ala Leu Asn 
165 170 175 

ggg ata gaa gaa gtc ttt ggc aag aga ggg gaa cgc ata ccc ttg atg 576 
Gly lie Glu Glu Val Phe Gly Lys Arg Gly Glu Arg lie Pro Leu Met 
180 185 190 
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gtg tec gtg aca atg gaa age atg ggg aca atg ttg gtc ggt tec gaa 624 

Val Ser Val Thr Met Glu Ser Met Gly Thr Met Leu Val Gly Ser Glu 
195 200 205 

ate aac gec gtc ctg aca att tta gaa cct ttc cca att gac att etc 

He Asn Ala Val Leu Thr He Leu Glu Pro Phe Pro He Asp Ije Leu 

210 215 



672 



220 



ggt ctg aac tgt gee aca ggc cca gac ttg atg aaa cca cat att aaa 
Gly Leu Asn Cys Ala Thr Gly Pro Asp Leu Met Lys Pro His He Lys 
225 230 235 



720 



240 



816 



864 



tat ttg get gaa cat teg ccg ttt gtg gtt tct tgt att cct aac gcg 768 
Tyr Leu Ala Glu His Ser Pro Phe Val Val Ser Cys He Pro Asn Ala 
24 5 250 255 

ggt tta cca gaa aac gtt ggt ggt caa gca cat tat cgc tta aca cca 
Gly Leu Pro Glu Asn Val Gly Gly Gin Ala His Tyr Arg Leu Thr Pro 
260 265 270 

atg gaa tta cgc atg gcg ttg atg cac ttt gtt gaa gat ttg ggt gtc 
Met Glu Leu Arg Met Ala Leu Met His Phe Val Glu Asp Leu Gly val 
27 5 280 285 

caa gtg ate ggg ggt tgc tgt ggg aca cgt cca gaa cac att caa caa 912 
Gin Val He Gly Gly Cys Cys Gly Thr Arg Pro Glu His He Gin Gin 
290 295 300 

tta gca gaa att gec aag gat tta aag cca aag gtg aga cag cca agt 
Leu Ala Glu He Ala Lys Asp Leu Lys Pro Lys Val Arg Gin Pro Ser 
305 310 315 320 

tta gaa cct gcg get gca tea ata tat agt act caa ccc tac gaa caa 
Leu Glu Pro Ala Ala Ala Ser He Tyr Ser Thr Gin Pro Tyr Glu Gin 
325 330 335 

gat aat tct ttc ttg att gtg ggt gaa cgc etc aac gee agt ggt tec 
Asp Asn Ser Phe Leu He Val Gly Glu Arg Leu Asn Ala Ser Gly Ser 
340 345 350 

III J? C * 9t 9at tt9 ° t9 aat 909 9aa 9 flt tgg 9 ac ^ a "9 gta 1104 
Lys Lys Cys Arg Asp Leu Leu Asn Ala Glu Asp Trp Asp Gly Leu Val 

355 360 365 

tea atg gcg cga teg caa gtc aag gaa ggc gca cat ate ctt gat gtc 1152 
Ser Met Ala Arg Ser Gin Val Lys Glu Gly Ala His He Leu Asp Val 
370 375 380 

lit ? 3t I** 9 ? a °" 930 99t 9tg cg 9 atg cac 9aa eta 
Asn Val Asp Tyr Val Gly Arg Asp Gly Val Arg Asp Met His Glu Leu 

385 390 395 400 

gtt tec cgc att gtg aat aat gtt aca etc ccc tta atg etc gac tec 
Val Ser Arg He Val Asn Asn Val Thr Leu Pro Leu Met Leu Asp Ser 
405 410 4X5 

^ a l" 2? a 339 at9 939 9C9 99t tta aag gt 9 Set ggt ggt aag 1296 
Thr Glu Trp Glu Lys Met Glu Ala Gly Leu Lys Val Ala Gly Gly Lys 
4 20 425 430 

tgt ttg ctg aac tec ace aac tac gaa gat ggg gaa cca cgt ttc tta 1344 
Cys Leu Leu Asn Ser Thr Asn Tyr Glu Asp Gly Glu Pro Arg Phe Leu 



960 



1008 



1056 



1200 



1248 



I 
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435 440 445 

aaa gtg ttg gag ttg gcg aag aaa tat ggc gcg ggt gtt gtt att ggc 1392 
Lys Val Leu Glu Leu Ala Lys Lye Tyr Qly Ala Gly Val Val lie Gly 
450 455 460 

aca att gac gaa gaa ggg atg gcg egg aca gec gag aaa aag ttt caa 1440 
Thr He Asp Glu Glu Gly Met Ala Arg Thr Ala Glu Lys Lys Phe Gin 
465 470 475 480 

att gec cag cgt gec tat cgt caa teg gta gaa tat ggg att ccc ccc 1488 
He Ala Gin Arg Ala Tyr Arg Gin Ser Val Glu Tyr Gly He Pro Pro 
485 490 495 

aca gaa ata ttc ttt gat acc tta get tta cca att tct acc ggg att 1536 
Thr Glu He Phe Phe Asp Thr Leu Ala Leu Pro He Ser Thr Gly He 
500 505 510 

gaa gaa gac egg gaa aat ggc aag gcg aca att gaa tea att age cgt 1584 
Glu Glu Asp Arg Glu Asn Gly Lys Ala Thr He Glu Ser He Ser Arg 
515 520 525 

ate cgt aaa gaa ttg cca ggg tgt cat gtt att tta ggc gtg tea aat 1632 
He Arg Lys Glu Leu Pro Gly Cys His Val He Leu Gly Val Ser Asn 
530 535 540 

ata tec ttt ggc tta aat tea gec teg egg atg gtc tta aac tec gtg 1680 
He Ser Phe Gly Leu Asn Ser Ala Ser Arg Met Val Leu Asn Ser Val 
545 550 555 560 

ttt etc cat gaa gca atg act get ggc atg gat gcg gcg ate gtc agt 1728 
Phe Leu His Glu Ala Met Thr Ala Gly Met Asp Ala Ala He Val Ser 
565 570 575 

get age aag att eta cca ctg teg aag att gaa gag cgt cat caa gaa 1776 
Ala Ser Lys He Leu Pro Leu Ser Lys He Glu Glu Arg His Gin Glu 
580 585 590 

gtc tgc cgc cag tta att tat gac cag cgt aaa ttt gag ggt gat ate 1824 
Val Cys Arg Gin Leu He Tyr Asp Gin Arg Lys Phe Glu Gly Asp He 
595 600 605 

tgc ate tat gac ccc tta aca gaa eta act aaa ttg ttt gag gga gtc 1872 
Cys He Tyr Asp Pro Leu Thr Glu Leu Thr Lys Leu Phe Glu Gly Val 
610 615 620 

acc acc aaa cgt aac aaa ggc gtt gat gaa age tta ccc ate gaa gaa 1920 
Thr Thr Lys Arg Asn Lys Gly Val Asp Glu Ser Leu Pro He Glu Glu 
625 630 635 640 

cga etc aag cgt cac att ate gac ggc gaa cgc att ggt tta gaa gcg 1968 
Arg Leu Lys Arg His He He Asp Gly Glu Arg He Gly Leu Glu Ala 
645 650 655 

caa ctg aca aaa gee tta gaa caa tat cca ccc eta gaa att ate aac 2016 
Gin Leu Thr Lys Ala Leu Glu Gin Tyr Pro Pro Leu Glu He He Asn 
660 665 670 

act ttc eta eta gat ggg atg aaa gta gtc ggg gaa ttg ttc ggt tea 2064 
Thr Phe Leu Leu Asp Gly Met Lys Val Val Gly Glu Leu Phe Gly Ser 
675 680 685 
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gga caa atg cag eta cct ttc gtt tta cag tea gcc gaa ace atg aaa 2112 
Gly Gin Met Gin Leu Pro Phe Val Leu Gin Ser Ala Glu Thr Met Lys 
690 695 700 

gcg gcg gta gcc tac eta gaa ccg ttc atg gaa aaa teg gaa agt ggc 2160 
Ala Ala Val Ala Tyr Leu Glu Pro Phe Met Glu Lys Ser Glu Ser Gly 
705 710 715 720 

aac aat gcc aaa ggt aaa gta att att gcc acc gtg aaa ggc gat gtt 2208 
Asn Asn Ala Lya Gly Lys Val He He . Ala Thr Val Lys Gly Asp Val 
725 730 735 

cac gac att ggt aaa aac eta gta gac att ate ttg tec aac aac ggc 2256 
His Asp lie Gly Lys Asn Leu Val Asp He He Leu Ser Asn Asn Gly 
740 745 750 

tac aag gta att aac ctg gga att aaa cag ccg gtg gaa aat ate ate 2304 
Tyr Lys Val He Asn Leu Gly lie Lys Gin Pro Val Glu Asn He He 
755 760 765 

gag get tac aac caa cac aaa get gat tgt att gcc atg agt ggc ttg 2352 
Glu Ala Tyr Asn Gin His Lys Ala Asp Cys He Ala Met Ser Gly Leu 
770 775 780 

ctg gta aaa tec acc gca ttc atg aaa gaa aat ttg gag gtc ttc aac 2400 
Leu Val Lys Ser Thr Ala Phe Met Lys Glu Asn Leu Glu Val Phe Asn 
785 790 795 800 

gaa aaa ggc att aat gtt cct gta att tta ggt ggt gcg gca tta acc 2448 
Glu Lys Gly He Asn Val Pro Val He Leu Gly Gly Ala Ala Leu Thr 
80S 810 815 

ccg aaa ttc gtg cat aaa gat tgc caa aat acc tac aaa ggt aaa gtc 2496 
Pro Lys Phe Val His Lys Asp Cys Gin Asn Thr Tyr Lys Gly Lys Val 
fl 20 825 830 

att tat ggc aaa gat get ttc tea gac ctg cat ttc atg gat aaa tta 2544 
He Tyr Gly Lys Asp Ala Phe Ser Asp Leu His Phe Met Asp Lys Leu 
835 840 845 

atg cca gcc aaa gcc act ggc aaa tgg gac aat tec tta gga ttc ttg 2592 
Met Pro Ala Lys Ala Thr Gly Lys Trp Asp Asn Ser Leu Gly Phe Leu 
850 855 860 

gat gaa gta gaa acc gag gaa aca gaa cct acc aat cac aaa tec cca 2640 
Asp Glu Val Glu Thr Glu Glu Thr Glu Pro Thr Asn His Lys Ser Pro 
865 870 875 880 

ate ccc agt ccc caa tec cca gtc ccc agt ccc cag tec cca gtc cct 2688 
He Pro Ser Pro Gin Ser Pro Val Pro Ser Pro Gin Ser Pro Val Pro 
885 890 895 



ata gac acc cga cgt tec gaa get gta gcc ata gac att ccc cgt ccc 2736 
He Asp Thr Arg Arg Ser Glu Ala Val Ala He Asp He Pro Arg Pro 
500 905 9io 

aca cca cca ttc tgg gga acg caa tta tta cag cct age gat att tec 2784 
Thr Pro Pro Phe Trp Gly Thr Gin Leu Leu Gin Pro Ser Asp He Ser 
915 920 925 

tta gag gaa ata ttc tgg cac atg gat ttg caa gcc ttg att gcg gga 2832 



[ 
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Leu Glu Glu He Phe Trp His Met Asp Leu Gin Ala Leu He Ala Gly 
930 935 940 

caa tgg caa ttc cgc aaa ccc aaa gaa caa tea aag gaa gaa tat caa 2880 
Gin Trp Gin Phe Arg Lye Pro Lys Glu Gin Ser Lys Glu Glu Tyr Gin 
945 950 955 960 

get ttc ttg aat gag aaa gtg tat cca gtt eta gaa act tgg aaa cag 2928 
Ala Phe Leu Asn Glu Lya Val Tyr Pro Val Leu Glu Thr Trp Lys Gin 
965 970 975 

cgc ate att gca gaa aac ttg tta cat ccc cag gta att tat ggg tat 2976 
Arg He He Ala Glu Asn Leu Leu His Pro Gin Val He Tyr Gly Tyr 
980 985 990 

ttt cct tgt caa tct gag ggt aat act tta tat gtt tac gaa aca aac 3024 
Phe Pro Cys Gin Ser Glu Gly Asn Thr Leu Tyr Val Tyr Glu Thr Asn 
995 1000 1005 

age cca aat gec aca gaa ate act cag ttt gaa ttc ccc cga caa aag 3072 
Ser Pro Asn Ala Thr Glu He Thr Gin Phe Glu Phe Pro Arg Gin Lya 
1010 1015 1020 

tea tea aaa cga tta tgt att gee gat ttc ttt gca ccg aaa gat tea 3120 
Ser Ser Lys Arg Leu Cys He Ala Asp Phe Phe Ala Pro Lys Asp Ser 
1025 1030 1035 1040 



gga ate att gat gtc ttc ccc atg cag gcg gtg act gta ggc gaa att 
Gly He He Asp Val Phe Pro Met Gin Ala Val Thr Val Gly Glu He 
1045 1050 1055 



3168 



get aca gag ttc gcg caa aaa ttg ttt gca aac aat caa tac act gat 3216 
Ala Thr Glu Phe Ala Gin Lys Leu Phe Ala Asn Asn Gin Tyr Thr Asp 
1060 io65 1070 

tat ctg tat ttt cac ggt ttg gcg gtg caa gta gca gaa gee ttg gec 3264 
Tyr Leu Tyr Phe His Gly Leu Ala Val Gin Val Ala Glu Ala Leu Ala 
1° 75 1080 1085 

nf 9 I" ZZ a °? C 9 ?° a9a at ° C9C C9t 939 tta ttc 99t get gaa 3312 
Glu Trp Thr His Ala Arg He Arg Arg Glu Leu Gly Phe Gly Ala Glu 
1090 1095 iioo 

gaa ccg gat aat ate egg gat att ttg gca caa cgc tat cag ggt tec 3360 
Glu Pro Asp Asn He Arg Asp He Leu Ala Gin Arg Tyr Gin Gly Ser 
1105 IHO 1115 U20 

egg tat agt ttt ggc tac cca get tgt ccc aat att caa gac cag ttt 3408 
Arg Tyr Ser Phe Gly Tyr Pro Ala Cys Pro Asn He Gin Asp Gin Phe 
ll 25 1130 H35 

aag cag ctg gat ttg ttg gag act age aga att aac tta tac atg gat 3456 
Lys Gin Leu Asp Leu Leu Glu Thr Ser Arg He Asn Leu Tyr Met Asp 
H40 ii45 H50 

gaa agt gag caa ctt tat cca gaa cag tct acg acg gcg att att act 3504 
Glu Ser Glu Gin Leu Tyr Pro Glu Gin Ser Thr Thr Ala He He Thr 
II 55 H60 lies 

tat cac cca gta get aag tac ttc ace gcg taa 3537 
Tyr His Pro Val Ala Lys Tyr Phe Thr Ala 
1170 1175 
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<210> 4 

<211> 1178 

<212> PRT 

<213> Anabaena sp. 

<400> 4 

Met Thr His Pro Phe Leu Lys Arg Leu His Ser Pro Glu Leu Pro Val 

1 5 10 15 

He Val Phe Asp Gly Ala Met Gly Thr Asn Leu Gin Thr Gin Asn Leu 
20 25 30 

Thr Ala Glu Asp Phe Gly Gly Val Gin Tyr Glu Gly Cys Asn Glu Tvr 
35 40 45 

Leu Val His Thr Lys Pro Glu Ala Val Ala Lys Val His Arg Asp Phe 
50 55 60 

Leu Ala Val Gly Ala Asp Val He Glu Thr Asp Thr Phe Gly Ala Thr 
65 70 75 80 

Ser He Val Leu Ala Glu Tyr Asp Leu Ala Asp Gin Thr Tyr Tyr Leu 
85 90 95 

Asn Lys Lys Ala Ala Glu Leu Ala Lys Ser Val Ala Ala Glu Phe Ser 
100 105 110 

Thr Pro Asp Lys Pro Arg Phe Val Ala Gly Ser He Gly Pro Thr Thr 
115 120 125 

Lys Leu Pro Thr Leu Gly His He Asp Phe Asp Thr Leu Lys Thr Cys 
130 135 140 

Phe Ala Glu Gin Ala Glu Ala Leu Leu Asp Gly Gly Val Asp Leu Leu 
145 150 155 160 

Leu Val Glu Thr Cys Gin Asp Val Leu Gin lie Lys Ala Ala Leu Asn 
165 170 175 

Gly He Glu Glu Val Phe Gly Lys Arg Gly Glu Arg He Pro Leu Met 
180 185 190 

Val Ser Val Thr Met Glu Ser Met Gly Thr Met Leu Val Gly Ser Glu 
195 200 205 

He Asn Ala Val Leu Thr He Leu Glu Pro Phe Pro He Asp He Leu 
210 215 220 

Gly Leu Asn Cys Ala Thr Gly Pro Asp Leu Met Lys Pro His He Lys 
225 2 30 235 240 

Tyr Leu Ala Glu His Ser Pro Phe Val Val Ser Cys He Pro Asn Ala 
245 250 255 

Gly Leu Pro Glu Asn Val Gly Gly Gin Ala His Tyr Arg Leu Thr Pro 
260 265 270 

Met Glu Leu Arg Met Ala Leu Met His Phe Val Glu Asp Leu Gly Val 
275 280 285 
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Gin Val lie Gly Gly Cye Cys Gly Thr Arg Pro Glu His lie Gin Gin 
290 295 300 

Leu Ala Glu He Ala Lys Asp Leu Lye Pro Lys Val Arg Gin Pro Ser 
305 310 315 320 

Leu Glu Pro Ala Ala Ala Ser He Tyr Ser Thr Gin Pro Tyr Glu Gin 
325 330 335 

Asp Asn Ser Phe Leu He Val Gly Glu Arg Leu Asn Ala Ser Gly Ser 
340 345 350 

Lys Lys Cys Arg Asp Leu Leu Asn Ala Glu Asp Trp Asp Gly Leu Val 
355 360 365 

Ser Met Ala Arg Ser Gin Val Lys Glu Gly Ala His He Leu Asp Val 
370 375 380 

Asn Val Asp Tyr Val Gly Arg Asp Gly Val Arg Asp Met His Glu Leu 
385 390 395 400 

Val Ser Arg He Val Asn Asn Val Thr Leu Pro Leu Met Leu Asp Ser 
405 410 415 

Thr Glu Trp Glu Lys Met Glu Ala Gly Leu Lys Val Ala Gly Gly Lys 
420 425 430 

Cys Leu Leu Asn Ser Thr Aen Tyr Glu Asp Gly Glu Pro Arg Phe Leu 
435 440 445 

Lys Val Leu Glu Leu Ala Lys Lys Tyr Gly Ala Gly Val Val He Gly 
450 455 460 

Thr He Asp Glu Glu Gly Met Ala Arg Thr Ala Glu Lys Lys Phe Gin 
465 470 475 480 

He Ala Gin Arg Ala Tyr Arg Gin Ser Val Glu Tyr Gly He Pro Pro 
485 490 495 

Thr Glu He Phe Phe Asp Thr Leu Ala Leu Pro He Ser Thr Gly He 
500 505 510 

Glu Glu Asp Arg Glu Asn Gly Lys Ala Thr He Glu Ser He Ser Arg 
515 520 525 

He Arg Lys Glu Leu Pro Gly Cys His Val He Leu Gly Val Ser Asn 
530 535 540 

He Ser Phe Gly Leu Asn Ser Ala Ser Arg Met Val Leu Asn Ser Val 
545 550 555 560 

Phe Leu His Glu Ala Met Thr Ala Gly Met Asp Ala Ala He Val Ser 
565 570 575 

Ala Ser Lys He Leu Pro Leu Ser Lys He Glu Glu Arg His Gin Glu 
580 585 590 

Val Cys Arg Gin Leu He Tyr Asp Gin Arg Lys Phe Glu Gly Asp He 
595 600 605 

Cys He Tyr Asp Pro Leu Thr Glu Leu Thr Lys Leu Phe Glu Gly Val 
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610 615 620 

Thr Thr Lye Arg Asn Lys Gly Val Asp Glu Ser Leu Pro He Glu Glu 
625 630 635 640 

Arg Leu Lys Arg His He He Asp Gly Glu Arg He Gly Leu Glu Ala 
645 650 655 

Gin Leu Thr Lya Ala Leu Glu Gin Tyr Pro Pro Leu Glu He He Asn 
660 665 670 

Thr Phe Leu Leu Asp Gly Met Lys Val Val Gly Glu Leu Phe Gly Ser 
675 680 685 

Gly Gin Met Gin Leu Pro Phe Val Leu Gin Ser Ala Glu Thr Met Lys 
690 695 . 700 

Ala Ala Val Ala Tyr Leu Glu Pro Phe Met Glu Lys Ser Glu Ser Gly 
705 710 715 720 

Asn Asn Ala Lys Gly Lys Val He He Ala Thr Val Lys Gly Asp Val 
725 730 * 735 

His Asp He Gly Lys Asn Leu Val Asp He He Leu Ser Asn Asn Gly 
740 745 750 

Tyr Lys Val He Asn Leu Gly lie Lys Gin Pro Val Glu Asn lie He 
755 760 765 

Glu Ala Tyr Asn Gin His Lys Ala Asp Cys He Ala Met Ser Gly Leu 
770 775 780 

Leu Val Lys Ser Thr Ala Phe Met Lys Glu Asn Leu Glu Val Phe Asn 
785 790 795 BOO 

Glu Lys Gly He Asn Val Pro Val He Leu Gly Gly Ala Ala Leu Thr 
805 810 815 

Pro Lys Phe Val His Lys Asp Cys Gin Asn Thr Tyr Lys Gly Lys Val 
820 825 830 

He Tyr Gly Lys Asp Ala Phe Ser Asp Leu His Phe Met Asp Lys Leu 
835 840 845 

Met Pro Ala Lys Ala Thr Gly Lys Trp Asp Asn Ser Leu Gly Phe Leu 
850 855 860 

Asp Glu Val Glu Thr Glu Glu Thr Glu Pro Thr Asn His Lys Ser Pro 
865 870 875 " 880 

He Pro Ser Pro Gin Ser Pro Val Pro Ser Pro Gin Ser Pro Val Pro 
885 890 895 

He Asp Thr Arg Arg Ser Glu Ala Val Ala He Asp He Pro Arg Pro 
900 905 910 

Thr Pro Pro Phe Trp Gly Thr Gin Leu Leu Gin Pro Ser Asp He Ser 
915 920 925 

Leu Glu Glu He Phe Trp His Met Asp Leu Gin Ala Leu He Ala Gly 
930 935 940 
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Gin Trp Gin Phe Arg Lys Pro Lys Glu Gin Ser Lys Glu Glu Tyr Gin 
945 950 955 . 960 

Ala Phe Leu Asn Glu Lys Val Tyr Pro Val Leu Glu Thr Trp Lys Gin 
965 970 975 

Arg He He Ala Glu Asn Leu Leu His Pro Gin Val He Tyr Gly Tyr 
980 985 990 

Phe Pro Cys Gin Ser Glu Gly Asn Thr Leu Tyr Val Tyr Glu Thr Asn 
955 1000 1005 

Ser Pro Asn Ala Thr Glu He Thr Gin Phe Glu Phe Pro Arg Gin Lys 
101 ° 1015 1020 

Ser Ser Lys Arg Leu Cys He Ala Asp Phe Phe Ala Pro Lys Asp Ser 
1025 1030 1035 1040 

Gly He He Asp Val Phe Pro Met Gin Ala Val Thr Val Gly Glu He 
1045 1050 1055 

Ala Thr Glu Phe Ala Gin Lys Leu Phe Ala Asn Asn Gin Tyr Thr Asp 
10*0 1065 1070 

Tyr Leu Tyr Phe His Gly Leu Ala Val Gin Val Ala Glu Ala Leu Ala 
1075 1080 1085 

Glu Trp Thr His Ala Arg lie Arg Arg Glu Leu Gly Phe Gly Ala Glu 
1090 1095 iioo 

Glu Pro Asp Asn He Arg Asp He Leu Ala Gin Arg Tyr Gin Gly Ser 
1105 mo ins 1120 

Arg Tyr Ser Phe Gly Tyr Pro Ala Cys Pro Asn He Gin Asp Gin Phe 
H25 H30 

Lys Gin Leu Asp Leu Leu Glu Thr Ser Arg He Asn Leu Tyr Met Asp 
1140 1145 H50 

Glu Ser Glu Gin Leu Tyr Pro Glu Gin Ser Thr Thr Ala He He Thr 
U 55 1160 U65 

Tyr HiB Pro Val Ala Lys Tyr Phe Thr Ala 
1170 ii75 

<210> 5 
<211> 3588 
<212> DNA 

<213> Synechocystis sp. 

<220> 
<221> CDS 
<222> (1) . . (3585) 
<223> RCY35965 

<400> 5 

atg aaa agt get ttt tta gac cgt ate cac agt ccc gat cgc ccg gta 48 

Met Lys Ser Ala Phe Leu Asp Arg He His Ser Pro Asp Arg Pro Val 
1 5 10 15 



gtc ttt gac ggg get atg ggt aca aac ctg cag gta cag aac eta 96 
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Leu Val Phe Asp Qly Ala Met Gly Thr Asn Leu Gin Val Gin Asn Leu 
20 25 30. 

acg gcg gcg gat ttt ggt ggg gcg gaa tac gaa ggt tgc. aat paa tat 144 
Thr 1 Ala Ala Asp Phe Gly Gly Ala Glu Tyr Glu Gly Cys Asn Glu Tyr 
35 40 45 

tta gtc cat acc aag cca gag gcc gtg get acg gtg cat cgt get ttt 192 
Leu Val His Thr Lys Pro Glu Ala Val Ala Thr Val His Arg Ala Phe 
50 55 60 

tac gaa gcg ggg gee gat gtc gtg gaa acg gat act ttt ggg gga acg 240 
Tyr Glu Ala Gly Ala Asp Val Val Glu Thr Asp Thr Phe Gly Gly Thr 
65 70 75 80 

ccc ctg gtg ctg gcg gag tac gat tta gca gac caa agt tat tac tta 28B 
Pro Leu Val Leu Ala Glu Tyr Asp Leu Ala Asp Gin Ser Tyr Tyr Leu 
85 90 * 95 

aat aaa gca gcg gcg gag ttg gcc aag gcg gta gca gcg gaa ttt tct 336 
Asn Lys Ala Ala Ala Glu Leu Ala Lys Ala Val Ala Ala Glu Phe Ser 
100 105 110 



acc cca gaa aag cct cga ttc gtg gcc ggc tec atg gga cca ggc acc 
Thr Pro Glu Lys Pro Arg Phe Val Ala Gly Ser Met Gly Pro Gly Thr 
115 120 125 



384 



aag eta ccc acc eta ggt cat gtg gac tac gat agt etc aag gat gcc 432 
Lys Leu Pro Thr Leu Gly His Val Asp Tyr Asp Ser Leu Lys ' Asp Ala 
130 135 140 

tat gtg gtt cag gtg egg ggt tta tac gat ggc gga gtg gat tta ttg 480 
Tyr Val Val Gin Val Arg Gly Leu Tyr Asp Gly Gly Val Asp Leu Leu 
"5 150. 155 160 

eta gtg gaa acc tgc cag gat gtg ctg caa att aaa gcg gcc ttg aac 526 
Leu Val Glu Thr Cys Gin Asp Val Leu Gin He Lys Ala Ala Leu Asn 
165 170 175 

gcc att gaa cag gtc ttt gcc gaa aaa ggc gat cgc eta ccg ttg atg 576 
Ala He Glu Gin Val Phe Ala Glu Lys Gly Asp Arg Leu Pro Leu Met 
160 185 " 190 

gtg tea gta acc atg gaa acc atg ggg acc atg ctg gtg ggt acg gag 624 
Val Ser Val Thr Met Glu Thr Met Gly Thr Met Leu Val Gly Thr Glu 
155 200 205 

atg gcg gcg gcc ctg gcc att ttg gag ccc tat ccc ate gat att ttg 672 
Met Ala Ala Ala Leu Ala He Leu Glu Pro Tyr Pro He Asp He Leu 
210 215 220 

ggg eta aac tgc gcc acc ggg cca gat ttg atg aag gaa cac gtt aaa 720 
Gly Leu Asn Cys Ala Thr Gly Pro Asp Leu Met Lys Glu His Val Lys 
225 230 235 240 



tat ctt tec gaa cat tec ccc ttt gtg gtg tec tgt att ccc aat get 

Tyr Leu Ser Glu His Ser Pro Phe Val Val Ser Cys He Pro Asn Ala 
245 250 255 

ggt ttg cca gaa aac gtt ggc ggt caa get ttt tat cgc etc acc ccg 

Gly Leu Pro Glu Asn Val Gly Gly Gin Ala Phe Tyr Arg Leu Thr Pro 
260 265 270 



768 



816 
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atg gaa ctg caa atg tec ctg atg cac ttc ate gaa gac ctg gga gta 864 
Met Glu Leu Gin Met Ser Leu Met His Phe lie Glu Asp Leu Gly Val 
275 280 285 

cag gta att ggt ggt tgt tgt ggc act aga ccc gat cac ate aag gec 912 
Gin Val He Gly Gly Cys Cys Gly Thr Arg Pro Asp His He Lys Ala 
290 295 300 

ct 9 gcg gat att gee aag gat etc cag ccc aaa caa cgc caa cct cac 960 
Leu Ala Asp He Ala Lys Asp Leu Gin Pro Lys Gin Arg Gin Pro His 
305 310 315 320 

tac gaa ccc age gee get tec att tat tec ace caa ace tac gec caa 1008 
Tyr Glu Pro Ser Ala Ala Ser He Tyr Ser Thr Gin Thr Tyr Ala Gin 
325 330 335 

gaa aat tct ttt tta ate att ggc gaa egg etc aat gec agt ggc teg 1056 
Glu Asn Ser Phe Leu He He Gly Glu Arg Leu Asn Ala Ser Gly Ser 
340 345 350 

aaa aaa tgt cga gat ctg etc aat get gaa gat tgg gac age eta gtt 1104 
Lys Lys Cys Arg Asp Leu Leu Asn Ala Glu Asp Trp Asp Ser Leu Val 
355 360 365 

tec ctg get aaa tec caa gtc aag gaa gga gec caa ate ctt gac gtc 1152 
Ser Leu Ala Lys Ser Gin Val Lys Glu Gly Ala Gin He Leu Asp Val 
370 375 380 

aac gtg gat tac gtt ggt cga gat ggg gta agg gac atg aaa gaa tta 1200 
Asn Val Asp Tyr Val Gly Arg Asp Gly Val Arg Asp Met Lys Glu Leu 
385 390 395 400 

get tec cga eta gtc aat aat gtc ace ctg ccg ttg atg ttg gac tec 1248 
Ala Ser Arg Leu Val Asn Asn Val Thr Leu Pro Leu Met Leu Asp Ser 
405 410 415 

ace gaa tgg caa aaa atg gag gcg ggt tta aaa gtt gca ggg gga aaa 1296 
Thr Glu Trp Gin Lys Met Glu Ala Gly Leu Lys Val Ala Gly Gly Lys 
420 425 430 

tgt att etc aat tec ace aac tac gaa gac ggg gaa gaa egg ttt tat 1344 
Cys He Leu Asn Ser Thr Asn Tyr Glu Asp Gly Glu Glu Arg Phe Tyr 
435 440 445 

aaa gtg tta gaa att gec aaa gaa tat gga get ggt att gtc att ggc 1392 
Lys Val Leu Glu He Ala Lys Glu Tyr Gly Ala Gly He Val He Gly 
450 455 460 

ace ate gat gaa gat ggc atg gga cgc act gca gat aaa aaa ttt gag 1440 
Thr He Asp Glu Asp Gly Met Gly Arg Thr Ala Asp Lys Lys Phe Glu 
465 470 475 480 

att gee aaa egg gec tac gaa gcg gcg ate gec ttt ggc att ccg gec 14 88 
He Ala Lys Arg Ala Tyr Glu Ala Ala He Ala Phe Gly He Pro Ala 
485 490 * 495 

aca gaa att ttc ttt gat cct tta get ctg cct att tec ace ggc att 1536 
Thr Glu He Phe Phe Asp Pro Leu Ala Leu Pro He Ser Thr Gly He 
500 505 510 

gaa gaa gac agg gag aac ggt aaa gee ace gtg gat get ate cgc aga 1584 
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Glu Glu Asp Arg Glu Asn Gly Lys Ala Thr Val Asp Ala He Arg Arg 
515 520 525 

att cgc cag gaa ttg ccc gat tgt cat att ttg ttg ggg gtt £ct aac 1632 
He Arg Gin Glu Leu Pro Asp Cys HiB He Leu Leu Gly Val Ser Asn 
530 535 540 

gtt tec ttt ggc ttg aat ccc gec get cgc cag gta etc aat tec ate 1680 
Val Ser Phe Gly Leu Asn Pro Ala Ala Arg Gin Val Leu Asn Ser He 
545 550 555 , 560 

ttt etc cac gaa tgt atg cag gtg ggc atg gat gcg gee att gtc agt 1728 
Phe Leu His Glu Cys Met Gin Val Gly Met Asp Ala Ala He Val Ser 
565 570 575 

gee aat aag att tta ccc ctg gca aaa att gac cca gaa caa caa caa 1776 
Ala Asn Lys He Leu Pro Leu Ala Lys He Asp Pro Glu Gin Gin Gin 
580 585 590 

gtc tgt eta gat tta ate tat gac cgc egg gaa ttt gaa gga gag cgc 1824 
Val Cys Leu Asp Leu He Tyr Asp Arg Arg Glu Phe Glu Gly Glu Arg 
595 600 605 

tgt aca tat gac ccg tta ace aaa etc ace act tta ttt gaa ggt aaa 1872 
Cys Thr Tyr Asp Pro Leu Thr Lys Leu Thr Thr Leu Phe Glu Gly Lys 
610 615 620 

acc acc aaa egg gat aaa tec ggt gat gee aat tta ccg gtg , gaa gaa 1920 
Thr Thr Lys Arg Asp Lys Ser Gly Asp Ala Asn Leu Pro Val Glu Glu 
625 630 635 640 

aga tta aaa cgc cac ate att gat ggg gaa aga ttg ggc tta gaa gag 1968 
Arg Leu Lys Arg His He He Asp Gly Glu Arg Leu Gly Leu Glu Glu 
645 650 * * 655 



gec etc aat gaa get tta aaa ctt tac get ccc tta gat ate att aac 
Ala Leu Asn Glu Ala Leu Lys Leu Tyr Ala Pro Leu Asp He He Asn 
660 665 670 



2016 



ate tat ttg ttg gat ggc atg aaa gtg gtg ggg gaa eta ttt ggt tec 2064 
He Tyr Leu Leu Asp Gly Met Lys Val Val Gly Glu Leu Phe Gly Ser 
675 680 685 

ggg caa atg cag ttg ccc ttt gtg ttg cag teg gee caa acc atg aaa 2112 
Gly Gin Met Gin Leu Pro Phe Val Leu Gin Ser Ala Gin Thr Met LyB 
690 695 700 

gcg gcg gtg get ttt tta gaa ccc cat atg gat aag gat gat tec gec 2160 
Ala Ala Val Ala Phe Leu Glu Pro His Met Asp Lys Asp Asp Ser Ala 
705 710 715 720 

gac aat get aag ggt act ttt tta att gee act gtt aag ggg gat gtc 2208 
Asp Asn Ala Lys Gly Thr Phe Leu He Ala Thr Val LyB Gly Asp Val 
725 730 735 

cat gat att ggc aaa aac tta gtg gat att ate ctt tec aac aat ggc 2256 
His Asp He Gly Lys Asn Leu Val Asp He He Leu Ser Asn Asn Gly 
740 745 750 

tat cga gtg gtc aac eta ggc att aaa cag cca gtg gaa aat att ate 2304 
Tyr Arg Val Val Asn Leu Gly lie Lys Gin Pro Val Glu Asn He He 
755 760 765 
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gaa gcc tac aaa aaa cac agg ccc gat tgc att gcc atg agt ggt ttg 2352 
Glu Ala Tyr Lys Lys His Arg Pro Asp Cys He Ala Met Ser Gly Leu 
770 775 780 

ttg gtc aaa tea act get ttt atg aag gaa aat tta gaa gtt ttc aac 2400 
Leu Val Lys Ser Thr Ala Phe Met Lys Glu Asn Leu Glu Val Phe Asn 
785 790 795 800 

caa gag ggc att act gtt ccc gtc att ctt ggt ggt get get tta acg 2448 
Gin Glu Gly He Thr Val Pro Val He Leu Gly Gly Ala Ala Leu Thr 
805 810 815 

cct aaa ttt gtt cac cag gac tgc caa aat ace tac aaa ggc caa gta 2496 
Pro Lys Phe Val His Gin Asp Cys Gin Asn Thr Tyr Lys Gly Gin Val 
820 825 830 

att tac ggc aaa gat gcg ttc gcc gat tta cat ttc atg gat aag eta 2544 
He Tyr Gly Lys Asp Ala Phe Ala Asp Leu His Phe Met Asp Lys Leu 
835 840 845 

atg ccc get aaa aat age cac aat tgg gat gat ttc cag ggc ttt tta 2592 
Met Pro Ala Lys Asn Ser His Asn Trp Asp Asp Phe Gin Gly Phe Leu 
850 855 860 

ggg gaa tat gca acg gaa aat ggc cat aat gtg ace act gat gat ggt 2640 
Gly Glu Tyr Ala Thr Glu Asn Gly His Asn Val Thr Thr Asp Asp Gly 
865 870 875 880 

get aaa act aat ttt ggc att gaa gaa gaa aaa tta att gac get agt 2688 
Ala Lys Thr Asn Phe Gly He Glu Glu Glu Lys Leu He Asp Ala Ser 
885 890 895 

gag cag tct agg gag ccg gag gta att gat act gtt cgt tct gaa gcg 2736 
Glu Gin Ser Arg Glu Pro Glu Val He Asp Thr Val Arg Ser Glu Ala 
900 905 910 

gtg gac cct gat eta gaa aga cct gtg cca cct ttt tgg ggc act aaa 2784 
Val Asp Pro Asp Leu Glu Arg Pro Val Pro Pro Phe Trp Gly Thr Lys 
915 920 925 

att ttg caa tec agt gat att tec etc gat gaa gtc ttc cct tta ctg 2832 
He Leu Gin Ser Ser Asp He Ser Leu Asp Glu Val Phe Pro Leu Leu 
930 935 940 

gat tta caa gca tta ttt gtt ggt cag tgg cag ttt cgc aaa cct agg 2880 
Asp Leu Gin Ala Leu Phe Val Gly Gin Trp Gin Phe Arg Lys Pro Arg 
945 950 955 960 

gag caa tec agg gaa gaa tac gag caa ttc eta gcg gaa aaa gtt cat 2928 
Glu Gin Ser Arg Glu Glu Tyr Glu Gin Phe Leu Ala Glu Lys Val His 
965 970 975 

ccc att ttg get gag tgg aaa ggt aag gtc atg gca gaa aat tta etc 2976 
Pro He Leu Ala Glu Trp Lys Gly Lys Val Met Ala Glu Asn Leu Leu 
980 985 990 

cat cct acg gtg gtt tat ggt tat ttt ccc tgt caa tec cag ggc aat 3024 
His Pro Thr Val Val Tyr Gly Tyr Phe Pro Cys Gin Ser Gin Gly Asn 
995 1000 1005 

ace ttg tta att tat gac cca gaa ttg gtc age caa aat aat ggc caa 3072 
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Thr Leu Leu lie Tyr Asp Pro Glu Leu Val Ser Gin Asn Asn Gly Gin 
1010 1015 1020 

att ccc cca gac gca acg gcg ate gec aaa ttt gag ttt ccc xgg caa 3120 
He Pro Pro Asp Ala Thr Ala He Ala LyB Phe Glu Phe Pro Arg Gin 
1025 1030 1035 • 1040 

aaa tea ggg egg egg etc tgt att gcg gac ttt ttt get tea aaa gaa 3168 
Lys Ser Gly Arg Arg Leu Cys He Ala Asp Phe Phe Ala Ser LyB Glu 
1045 1050 ,1055 



teg ggg att act gat gtt ttt cct ttg caa gcg gtt aca gtg ggg gaa 3216 
Ser Gly He Thr Asp Val Phe Pro Leu Gin Ala Val Thr Val Gly Glu 
1060 1065 1070 

ate gcg acg gaa tat gca agg aaa ctt ttt get ggc gat aat tac acc 3264 
He Ala Thr Glu Tyr Ala Arg Lys Leu Phe Ala Gly Asp Asn Tyr Thr 
1075 1080 10B5 

gat tac etc tac ttc cac ggc atg gcg gtg cag atg gcg gaa get tta 3312 
Asp Tyr Leu Tyr Phe His Gly Met Ala Val Gin Met Ala Glu Ala Leu 
1090 1095 1100 

gcg gag tgg act cac caa egg ata cgt cag gaa ttg ggc ttt ggc cat 3360 
Ala Glu Trp Thr His Gin Arg He Arg Gin Glu Leu Gly Phe Gly His 
1105 1110 1115 1120 

tta gat cca gat aac ate cgt gat ctt etc cag caa cgt tac caa ggt 3408 
Leu Asp Pro Asp Asn He Arg Asp Leu Leu Gin Gin Arg Tyr Gin Gly 
1125 1130 " 1135 

tec cgc tac agt ttt ggt tat ccc get tgt ccc aac atg cag gat caa 3456 
Ser Arg Tyr Ser Phe Gly Tyr Pro Ala Cys Pro Asn Met Gin Asp Gin 
1140 1145 1150 

tac aca caa tta gaa ttg tta caa acc gaa cga att ggc ttg tat atg 3504 
Tyr Thr Gin Leu Glu Leu Leu Gin Thr Glu Arg He Gly Leu Tyr Met 
1155 1160 " 1165 

gat gaa agt gaa cag gtt tat cca gaa caa tec acc acg gcg att att 3552 
Asp Glu Ser Glu Gin Val Tyr Pro Glu Gin Ser Thr Thr Ala He He 
1170 1175 1180 

tec tat cat cct gcg get aaa tat ttc age get taa 3588 
Ser Tyr His Pro Ala Ala Lys Tyr Phe Ser Ala 
1185 1190 1195 



<210> 6 
<211> 1195 
<212> PRT 

<213> Synechocystis sp. 
<400> 6 

Met Lys Ser Ala Phe Leu Asp Arg He His Ser Pro Asp Arg Pro Val 
1 5 10 15 

Leu Val Phe Asp Gly Ala Met Gly Thr Asn Leu Gin Val Gin Asn Leu 
20 25 30 



Thr Ala Ala Asp Phe Gly Gly Ala Glu Tyr Glu Gly Cys Asn Glu Tyr 
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35 40 45 

Leu Val His Thr Lys Pro Glu Ala Val Ala Thr Val His Arg Ala Phe 
50 55 60 

Tyr Glu Ala Gly Ala Asp Val Val Glu Thr Asp Thr Phe Gly Gly Thr 
65 70 75 80 

Pro Leu Val Leu Ala Glu Tyr Asp Leu Ala Asp Gin Ser Tyr Tyr Leu 
85 90 95 

Asn Lys Ala Ala Ala Glu Leu Ala Lys Ala Val Ala Ala Glu Phe Ser 
100 105 no 

Thr Pro Glu Lys Pro Arg Phe Val Ala Gly Ser Met Gly Pro Gly Thr 
115 120 125 

Lys Leu Pro Thr Leu Gly His Val Asp Tyr Asp Ser Leu Lys Asp Ala 
130 135 140 

Tyr Val Val Gin Val Arg Gly Leu Tyr Asp Gly Gly Val Asp Leu Leu 
"5 150 155 160 

Leu Val Glu Thr Cys Gin Asp Val Leu Gin lie Lys Ala Ala Leu Asn 
165 170 175 

Ala He Glu Gin Val Phe Ala Glu Lys Gly Asp Arg Leu Pro Leu Met 
180 185 190 

Val Ser Val Thr Met Glu Thr Met Gly Thr Met Leu Val Gly Thr Glu 
195 200 205 

Met Ala Ala Ala Leu Ala He Leu Glu Pro Tyr Pro He Asp He Leu 
210 215 220 

Gly Leu Asn Cys Ala Thr Gly Pro Asp Leu Met Lys Glu His Val Lys 
225 230 235 240 

Tyr Leu Ser Glu His Ser Pro Phe Val Val Ser Cys He Pro Asn Ala 
245 250 255 

Gly Leu Pro Glu Asn Val Gly Gly Gin Ala Phe Tyr Arg Leu Thr Pro 
260 265 270 

Met Glu Leu Gin Met Ser Leu Met His Phe He Glu Asp Leu Gly Val 
275 280 285 

Gin Val He Gly Gly Cys Cys Gly Thr Arg Pro Asp His He Lys Ala 
290 295 300 

Leu Ala Asp He Ala Lys Asp Leu Gin Pro Lys Gin Arg Gin Pro His 
305 310 315 ~ 320 

Tyr Glu Pro Ser Ala Ala Ser He Tyr Ser Thr Gin Thr Tyr Ala Gin 
325 330 335 

Glu Asn Ser Phe Leu He He Gly Glu Arg Leu Asn Ala Ser Gly Ser 
340 345 ^ 350 

Lys Lys Cys Arg Asp Leu Leu Asn Ala Glu Asp Trp Asp Ser Leu Val 
355 360 " 365 
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Ser Leu Ala Lys Ser Gin Val Lys Glu Gly Ala Gin He Leu Asp Val 
370 375 380 

Asn Val Asp Tyr Val Gly Arg Asp Gly Val Arg Asp Met Lys -Glu Leu 
385 390 395 400 

Ala Ser Arg Leu Val Asn Asn Val Thr Leu Pro Leu Met Leu Asp Ser 
405 410 415 

Thr Glu Trp Gin Lys Met Glu Ala Gly . Leu Lys Val Ala Gly Gly Lys 
420 425 430 

Cys He Leu Asn Ser Thr Asn Tyr Glu Asp Gly Glu Glu Arg Phe Tyr 
435 440 445 

Lys Val Leu Glu He Ala Lys Glu Tyr Gly Ala Gly He Val He Gly 
450 455 460 

Thr He Asp Glu Asp Gly Met Gly Arg Thr Ala Asp Lys Lys Phe Glu 
465 470 475 480 

He Ala Lys Arg Ala Tyr Glu Ala Ala He Ala Phe Gly He Pro Ala 
485 490 495 

Thr Glu He Phe Phe Asp Pro Leu Ala Leu Pro He Ser Thr Gly He 
500 505 510 

Glu Glu Asp Arg Glu Asn Gly Lys Ala Thr Val Asp Ala He Arg Arg 
515 520 525 

He Arg Gin Glu Leu Pro Asp Cys His He Leu Leu Gly Val Ser Asn 
530 535 540 

Val Ser Phe Gly Leu Asn Pro Ala Ala Arg Gin Val Leu Asn Ser He 
545 550 555 560 

Phe Leu His Glu Cys Met Gin Val Gly Met Asp Ala Ala He Val Ser 
565 570 575 

Ala Asn Lys lie Leu Pro Leu Ala Lys He Asp Pro Glu Gin Gin Gin 
580 585 590 

Val Cys Leu Asp Leu He Tyr Asp Arg Arg Glu Phe Glu Gly Glu Arg 
595 600 605 

Cys Thr Tyr Asp Pro Leu Thr Lys Leu Thr Thr Leu Phe Glu Gly Lys 
610 615 620 

Thr Thr Lys Arg Asp Lys Ser Gly Asp Ala Asn Leu Pro Val Glu Glu 
625 630 635 640 

Arg Leu Lys Arg His He He Asp Gly Glu Arg Leu Gly Leu Glu Glu 
645 650 * 655 

Ala Leu Asn Glu Ala Leu Lys Leu Tyr Ala Pro Leu Asp He He Asn 
660 665 670 

He Tyr Leu Leu Asp Gly Met Lys Val Val Gly Glu Leu Phe Gly Ser 
675 680 685 

Gly Gin Met Gin Leu Pro Phe Val Leu Gin Ser Ala Gin Thr Met Lys 
690 695 700 
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Ala Ala Val Ala Phe Leu Glu Pro His Met Asp Lys Asp Asp Ser Ala 
705 710 715 * 720 

Asp Asn Ala Lys Gly Thr Phe Leu He Ala Thr Val Lys Gly Asp Val 
725 730 735 

His Asp He Gly Lys Asn Leu Val Asp He He Leu Ser Asn Asn Gly 
740 745 750 

Tyr Arg Val Val Asn Leu Gly He Lys Gin Pro Val Glu Asn He He 
755 760 765 

Glu Ala Tyr Lys Lys His Arg Pro Asp Cys He Ala Met Ser Gly Leu 
770 775 780 

Leu Val Lys Ser Thr Ala Phe Met Lys Glu Asn Leu Glu Val Phe Asn 
785 790 795 800 

Gin Glu Gly He Thr Val Pro Val He Leu Gly Gly Ala Ala Leu Thr 
80S 810 815 

Pro Lys Phe Val His Gin Asp Cys Gin Asn Thr Tyr Lys Gly Gin Val 
820 825 830 

He Tyr Gly Lys Asp Ala Phe Ala Asp Leu His Phe Met Asp Lys Leu 
835 840 845 

Met Pro Ala Lys Asn Ser His Asn Trp Asp Asp Phe Gin Gly Phe Leu 
650 655 860 

Gly Glu Tyr Ala Thr Glu Asn Gly His Asn Val Thr Thr Asp Asp Gly 
865 870 875 880 

Ala Lys Thr Asn Phe Gly He Glu Glu Glu Lys Leu He Asp Ala Ser 
885 890 895 

Glu Gin Ser Arg Glu Pro Glu Val He Asp Thr Val Arg Ser Glu Ala 
900 905 9io 

Val Asp Pro Asp Leu Glu Arg Pro Val Pro Pro Phe Trp Gly Thr Lys 
915 920 925 

He Leu Gin Ser Ser Asp He Ser Leu Asp Glu Val Phe Pro Leu Leu 
930 935 940 

Asp Leu Gin Ala Leu Phe Val Gly Gin Trp Gin Phe Arg Lys Pro Arg 
545 950 955 960 

Glu Gin Ser Arg Glu Glu Tyr Glu Gin Phe Leu Ala Glu Lys Val His 
965 970 975 

Pro He Leu Ala Glu Trp Lys Gly Lys Val Met Ala Glu Asn Leu Leu 
980 985 990 

His Pro Thr Val Val Tyr Gly Tyr Phe Pro Cys Gin Ser Gin Gly Asn 
995 1000 1005 

Thr Leu Leu He Tyr Asp Pro Glu Leu Val Ser Gin Asn Asn Gly Gin 
1010 1015 1020 

He Pro Pro Asp Ala Thr Ala He Ala Lys Phe Glu Phe Pro Arg Gin 
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1025 1030 1035 1040 

Lye Ser Gly Arg Arg Leu Cys He Ala Asp Phe Phe Ala Ser Lys Glu 
1045 1050 -1055 

Ser Gly He Thr Asp Val Phe Pro Leu Gin Ala Val Thr Val Gly Glu 
1060 1065 1070 

He Ala Thr Glu Tyr Ala Arg Lys Leu Phe Ala Gly Asp Asn Tyr Thr 
1075 10B0 1085 

Asp Tyr Leu Tyr Phe His Gly Met Ala Val Gin Met Ala Glu Ala Leu 
1090 1095 1100 

Ala Glu Trp Thr His Gin Arg He Arg Gin Glu Leu Gly Phe Gly His 
H05 1110 1115 * 1120 

Leu Asp Pro Asp Asn He Arg Asp Leu Leu Gin Gin Arg Tyr Gin Gly 
1125 1130 1135 

Ser Arg Tyr Ser Phe Gly Tyr Pro Ala Cys Pro Asn Met Gin Asp Gin 
1140 1145 1150 

Tyr Thr Gin Leu Glu Leu Leu Gin Thr Glu Arg He Gly Leu Tyr Met 
1155 1160 ~ H65 

Asp Glu Ser Glu Gin Val Tyr Pro Glu Gin Ser Thr Thr Ala He He 
1170 1175 HBO 

Ser Tyr HiB Pro Ala Ala Lys Tyr Phe Ser Ala 
1185 1190 H95 



<210> 7 
<211> 3561 
<212> DNA 

<213> Prochlorococcus marinus 

<220> 

<221> CDS 

<222> (1) . . (3558) 

<223> RCK00830 



<400> 7 

atg gtt tea ttt aga aat tat tta aat aga gat gat aaa cca att att 4 8 

Met Val Ser Phe Arg Asn Tyr Leu Asn Arg Asp Asp Lys Pro He He 
1 5 10 is 

att ttc gat ggt ggg aca ggt act tct ttt caa aat tta aat tta tea 96 
He Phe Asp Gly Gly Thr Gly Thr Ser Phe Gin Asn Leu Asn Leu Ser 
20 25 30 

tea cat gat ttt ggt gga gat gat tta gag ggt tgc aat gaa aac tta 144 
Ser His Asp Phe Gly Gly Asp Asp Leu Glu Gly Cys Asn Glu Asn Leu 
35 40 * 45 

gtt eta tec tct cct aat act gtt gaa caa gta cat aat tea ttt ctt 192 
Val Leu Ser Ser Pro Asn Thr Val Glu Gin Val His Asn Ser Phe Leu 
50 55 60 

gaa gca ggt tgt cat gta att gaa ace aat aca ttt ggt get tea tct 240 
Glu Ala Gly Cys His Val He Glu Thr Asn Thr Phe Gly Ala Ser Ser 
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65 70 75 80 

att gtt tta gac gaa tat agt att tct aat aaa get tat gaa ate aat 288 
He Val Leu Asp Glu Tyr Ser He Ser Asn Lys Ala Tyr Glu He Asn 
85 90 95 

aaa aaa gca get cag ata get aaa aaa tgt gca aat tta ttt tea tct 336 
Lye Lys Ala Ala Gin He Ala Lys Lys Cys Ala Asn Leu Phe Ser Ser 
100 105 no 

att aat act cct aga ttt gtc get gga tea att ggg cca act aca aaa 384 
He Asn Thr Pro Arg Phe Val Ala Gly Ser He Gly Pro Thr Thr Lys 
115 120 125 

tta cca aca tta ggt cat att agt ttt gat aag ctt aaa gat tea tat 432 
Leu Pro Thr Leu Gly His He Ser Phe Asp Lys Leu Lys Asp Ser Tyr 
130 135 140 

gaa gaa caa ata aat ggt eta att gac gga ggt att gac ctt eta ttg 480 
Glu Glu Gin He Asn Gly Leu He Asp Gly Gly He Asp Leu Leu Leu 
145 150 155 160 

att gaa aca tgc caa gat gtt tta caa ata aaa tea gca tta tct get 528 
He Glu Thr Cys Gin Asp Val Leu Gin He Lys Ser Ala Leu Ser Ala 
165 170 175 

tct caa gaa gtt att aaa aac agg aat att gaa tta cca ata atg ata 576 
Ser Gin Glu Val He Lys Asn Arg Asn He Glu Leu Pro He Met He 
180 185 190 

tec ata act atg gaa ace aca gga acg atg ctt gtc ggg tea gat ata 624 
Ser He Thr Met Glu Thr Thr Gly Thr Met Leu Val Gly Ser Asp He 
195 200 205 



get tct gca tta aca ata tta gag cca tac aat att gat att ctg gga 
Ala Ser Ala Leu Thr He Leu Glu Pro Tyr Asn lie Asp He Leu Gly 



672 



210 215 220 



ctg aat tgt gca act ggt cca gtt caa atg aaa gaa cat att aag tat 720 
Leu Asn Cys Ala Thr Gly Pro Val Gin Met Lys Glu His He Lys Tyr 
225 230 235 240 

tta get gaa aat tea cct ttt gca att agt tgt ata cct aat gca gga 768 
Leu Ala Glu Asn Ser Pro Phe Ala He Ser Cys He Pro Asn Ala Gly 
245 250 255 

tta cct gaa aat ata gga ggt gtt get cac tat aaa tta act cca ttg 816 
Leu Pro Glu Asn He Gly Gly Val Ala His Tyr Lys Leu Thr Pro Leu 
260 265 270 

gag ttg aaa atg cag tta atg aac ttt att tat gat ttt aac gta caa 864 
Glu Leu Lys Met Gin Leu Met Asn Phe He Tyr Asp Phe Asn Val Gin 
275 280 285 

ctt att ggc gga tgt tgt ggt act act cct gaa cat ate aag cat tta 912 
Leu He Gly Gly Cys Cys Gly Thr Thr Pro Glu His He Lys His Leu 
290 295 300 

tea tea ate att gag gaa ata gtt gat aaa aaa ata aat aaa aga ctt 960 
Ser Ser He He Glu Glu He Val Asp Lys Lys He Asn Lys Arg Leu 
305 310 315 320 
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cct act gta aaa aca aat ttt gtt cct tea gca get tct ata tat aac 1008 
Pro Thr Val Lys Thr Asn Phe Val Pro Ser Ala Ala Ser lie Tyr Asn 
325 330 335 

gca gtt cca tat aaa caa gat aac tea ata tta ata gtt gga gaa cgt 1056 
Ala Val Pro Tyr Lys Gin Asp Asn Ser lie Leu He Val Gly Glu Arg 
340 345 350 

tta aat get agt gga tea aaa aaa gta agg gaa tta eta aat gaa gat 1104 
Leu Asn Ala Ser Gly Ser Lys Lys Val Arg Glu Leu Leu As.n Glu Asp 
355 360 365 

gat tgg gac ggc ctg eta tea att get aaa caa cag caa aaa gaa aat 1152 
Asp Trp Asp Gly Leu Leu Ser He Ala Lys Gin Gin Gin Lys Glu Asn 
370 375 380 

get cac ata eta gat gtc aat gtt gat tat gta gga aga gat gga gtt 1200 
Ala His lie Leu Asp Val Asn Val Asp Tyr Val Gly Arg Asp Gly Val 
385 390 395 "* 400 

aaa gat atg aaa gaa att ace tea aga tta gtt aca aat ata aat ctt 1248 
LyB Asp Met Lys Glu He Thr Ser Arg Leu Val Thr Asn He Asn Leu 
405 410 415 

cca tta atg ata gat tea aca gaa gca gat aaa atg gaa agt gga tta 1296 
Pro Leu Met He Asp Ser Thr Glu Ala Asp Lys Met Glu Ser Gly Leu 
420 425 430 

aag act gta gga gga aaa tgc att ata aat tea aca aac tac gaa gat 1344 
Lys Thr Val Gly Gly Lys Cys He He Asn Ser Thr Asn Tyr Glu Asp 
435 440 445 

gga gat gac aga ttt aat cag gtc tta aga ctt gca tta gat tat ggt 1392 
Gly Asp Asp Arg Phe Asn Gin Val Leu Arg Leu Ala Leu Asp Tyr Gly 
450 455 460 

get gga ata gta att gga act att gat gaa gat gga atg gca aga aca 1440 
Ala Gly He Val He Gly Thr He Asp Glu Asp Gly Met Ala Arg Thr 
465 470 475 480 

tea cag aaa aaa tat gac att gca aaa aga gca tta att aaa act aga 1488 
Ser Gin Lys Lys Tyr Asp He Ala Lys Arg Ala Leu He Lys Thr Arg 
485 490 495 

tea agt ggc etc get gat tat gag ata ttt ttt gat cct eta gca ttg 1536 
Ser Ser Gly Leu Ala Asp Tyr Glu He Phe Phe Asp Pro Leu Ala Leu 
500 505 510 

cca ata tct act gga att gaa gaa gat aga tta aat get aaa gca act 1584 
Pro He Ser Thr Gly He Glu Glu Asp Arg Leu Asn Ala Lys Ala Thr 
515 520 525 

att gaa get ata tea aaa ata aga aaa age ttt cca gat att cat att 1632 
He Glu Ala He Ser Lys He Arg Lys Ser Phe Pro Asp He His He 
530 535 540 

att tta ggg ata tct aat att agt ttc ggg ctt tea cca tta tea aga 1680 
He Leu Gly lie Ser Asn He Ser Phe Gly Leu Ser Pro Leu Ser Arg 
545 550 555 560 

att aat eta aat tea ata ttt etc gat gaa tgt ata aag gca gga tta 1728 
He Asn Leu Asn Ser He Phe Leu Asp Glu Cys He Lys Ala Gly Leu 



f 
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565 570 575 

gat tea gcg att att gca cca aat aaa ata ttg cct ctt tea aaa ata 1776 
Asp Ser Ala He He Ala Pro Asn Lys He Leu Pro Leu Ser Lys He 
580 585 590 

tct gcg gaa aca aaa aaa tta tgt tta gat tta att tat gac aga aga 1824 
Ser Ala Glu Thr Lys Lye Leu Cys Leu Asp Leu He Tyr Asp Arg Arg 
595 600 605 

aat ttc gaa aat gaa ata tgt ata tat gat cca tta gtt gaa eta aca 1872 
Asn Phe Glu Asn Glu lie Cys He Tyr Asp Pro Leu Val Glu Leu Thr 
610 615 620 



aaa gca ttc caa gat ata aca ate agt gac ttt aaa aaa gga tct act 
Lys Ala Phe Gin Asp He Thr He Ser Asp Phe Lys Lys Gly Ser Thr 
625 630 635 640 



9 a t ggg gaa aaa ata ggt tta gaa gaa caa tta aat aat gcg ctt aaa 
Asp Gly Glu Lys He Gly Leu Glu Glu Gin Leu Asn Asn Ala Leu Lys 
660 665 670 

aag tac aaa cca ctt gaa ata att aat act tat tta tta gat gga atg 
Lys Tyr Lys Pro Leu Glu He He Asn Thr Tyr Leu Leu Asp Gly Met 
675 680 685 



cct cat atg gaa aca gta gat gaa aaa ata tct aac gga aaa tta eta 
Pro His Met Glu Thr Val Asp Glu Lys He Ser Asn Gly Lys Leu Leu 
725 730 735 



gac tgt att get atg agt ggt tta ctt gtt aaa tct aca gca ttt atg 
Asp Cys He Ala Met Ser Gly Leu Leu Val Lys Ser Thr Ala Phe Met 
785 790 795 ~ 800 



1920 



tea aac aaa aac etc acc tta gaa gaa aaa ctt aaa aac cat att gta 1968 
Ser Asn Lys Asn Leu Thr Leu Glu Glu Lys Leu Lys Asn His He Val 
645 650 655 



2016 



2064 



aaa gta gtc ggt gaa eta ttt gga tec ggc caa atg caa tta cct ttt 2112 
Lys Val Val Gly Glu Leu Phe Gly Ser Gly Gin Met Gin Leu Pro Phe 
690 695 700 

gta ttg caa tea gcg gaa aca atg aaa ttt get gtt tea gtg ctt gaa 2160 
Val Leu Gin Ser Ala Glu Thr Met Lys Phe Ala Val Ser Val Leu Glu 
7 °5 710 715 720 



2208 



ata gca act gtt aaa gga gat gtt cat gat ata ggt aaa aat tta gtt 2256 

He Ala Thr Val Lys Gly Asp Val His Asp He Gly LyB Asn Leu Val 
740 745 750 

gat ata att etc tea aat aat ggt ttt gat gta ate aac ctt gga att 2304 

Asp He He Leu Ser Asn Asn Gly Phe Asp Val He Asn Leu Gly He 

755 760 765 

aag caa gat gtt tea gcg att att gat gca caa aaa aaa cat aaa gca 2352 

Lys Gin Asp Val Ser Ala He He Asp Ala Gin Lys Lys His Lys Ala 
770 775 780 



2400 



aag gat aat tta gaa gca ttt aac aat get gaa att aat gtt cca gtt 2448 
Lys Asp Asn Leu Glu Ala Phe Asn Asn Ala Glu He Asn Val Pro Val 
805 810 815 
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att ctt gga ggt gca gca tta act cca aaa ttt gtg aat gaa gat tgt 2496 
He Leu Gly Gly Ala Ala Leu Thr Pro Lye Phe Val Asn Glu Asp Cys 
820 825 830 

agt cag ata tat aaa ggt aaa att ttg tat ggg aaa gat get ttt aca 2544 
Ser Gin He Tyr Lye Gly Lys He Leu Tyr Gly Lys Asp Ala Phe Thr 
835 840 845 

gat tta caa ttt atg aat gac tat atg gat agt aaa aag aag ggc aat 2592 
Asp Leu Gin Phe Met Asn Asp Tyr Met Asp Ser Lys Lys Lys Gly Asn 
650 855 860 

tgg tct aat gaa aat ggt ttt act aat act gat gat att caa att aaa 2640 
Trp Ser Asn Glu Asn Gly Phe Thr Asn Thr Asp Asp He Gin He Lys - 
865 870 875 860 

tta get tec cca agg tct tec get aaa gat aaa aat tta aat aaa aat 2686 
Leu Ala Ser Pro Arg Ser Ser Ala Lys Asp Lys Asn Leu Asn Lys Asn 
885 890 895 

ttt gaa aaa acc aaa agt att caa tta att gag aat ttt aat aga tct 2736 
Phe Glu Lys Thr Lys Ser He Gin Leu He Glu Asn Phe Asn Arg Ser 
900 905 910 

aat ttt gta gag gaa gag gaa cct ata aag get cca ttt ttg gga act 2784 
Asn Phe Val Glu Glu Glu Glu Pro He Lys Ala Pro Phe Leu Gly Thr 
915 920 925 

aga gtt ctt caa gat att gaa ata gac ttt gac aaa eta att ttt tat 2832 
Arg Val Leu Gin Asp He Glu He Asp Phe Asp Lys Leu He Phe Tyr 
930 935 940 

eta gat aaa aaa gca tta ttt agt ggt caa tgg caa att aaa aaa aat 2880 
Leu Asp Lys Lys Ala Leu Phe Ser Gly Gin Trp Gin He Lys Lys Asn 
945 950 955 960 

aaa ggt caa tea gta gaa gaa tac aat aat tat tta gat tea tat gca 2928 
Lys Gly Gin Ser Val Glu Glu Tyr Asn Asn Tyr Leu Asp Ser Tyr Ala 
965 970 975 

aat cca tta ctt gaa aaa tgg att aat att att tta gat aaa ggc tta 2976 
Asn Pro Leu Leu Glu Lys Trp He Asn He He Leu Asp Lys Gly Leu 
980 985 990 

att tea cca aaa gta gtc tat ggc tac ttc cgt tgc ggg agg aat gat 3024 
He Ser Pro Lys Val Val Tyr Gly Tyr Phe Arg Cys Gly Arg Asn Asp 
995 1000 1005 

aat agt att tat etc ttt gat aat gta tea aat aaa aga att tct gaa 3072 
Asn Ser He Tyr Leu Phe Asp Asn Val Ser Asn Lys Arg He Ser Glu 
1010 1015 1020 

ttt aac ttt cct aga caa aaa teg gga aat aat ctt tgt att gca gat 3120 
Phe Asn Phe Pro Arg Gin Lys Ser Gly Asn Asn Leu Cys He Ala Asp 
1025 1030 1035 1040 

ttt tac tgt gat ctt aaa aat aat gat cca gta gat ata ttt cca atg 3168 
Phe Tyr Cys Asp Leu Lye Asn Asn Asp Pro Val Asp He Phe Pro Met 
1045 1050 1055 

caa gca gta aca atg ggg gaa ata get age gaa tat tec caa gaa tta 3216 
Gin Ala Val Thr Met Gly Glu He Ala Ser Glu Tyr Ser Gin Glu Leu 
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1060 



1065 



1070 



ttt aaa get gat aaa tat agt gat tat tta ata ttt cat ggt tta acc 3264 
Phe Lys Ala Asp Lye Tyr Ser Aep Tyr Leu He Phe His Gly Leu Thr 
1075 1060 1085 



gtt caa tta gca gaa get ctt gca gaa tat gtt cat tea ata gta aga 
Val Gin Leu Ala Glu Ala Leu Ala Glu Tyr Val His Ser He Val Arg 
1090 1095 1100 



3312 



att gaa tgc gga ttt aaa tea tat gag cca aac aat aac cgt gat ata 
He Glu Cys Gly Phe Lys Ser Tyr Glu Pro Asn Asn Asn Arg Asp He 
1105 1110 1H5 1120 



3360 



tta get caa aaa tat aga gga get aga tac tea ttt ggt tat cca get 3408 
Leu Ala Gin Lys Tyr Arg Gly Ala Arg Tyr Ser Phe Gly Tyr Pro Ala 
1125 1130 1135 

tgt cct aaa gtt tct gat tea aat ata cag tta tea tta ttg gat aca 3456 
Cys Pro Lys Val Ser Asp Ser Asn He Gin Leu Ser Leu Leu Asp Thr 
1140 1145 1150 

aaa agg att aat tta aca atg gat gaa tea gag caa tta cat cct gaa 3504 
Lys Arg He Asn Leu Thr Met Asp Glu Ser Glu Gin Leu His Pro Glu 
1155 1160 1165 

caa agt act act get ata att tea ctt cat tea aaa gca aaa tat ttt 3552 
Gin Ser Thr Thr Ala He He Ser Leu His Ser Lys Ala Lys Tyr Phe 
1170 1175 1180 



agt gee taa 
Ser Ala 
1185 



3561 



<210> 8 
<211> 1186 
<212> PRT 

<213> Prochlorococcus marinus 
<400> 8 

Met Val Ser Phe Arg Asn Tyr Leu Asn Arg Asp Asp Lys Pro He He 
1 5 10 ' 15 

He Phe Asp Gly Gly Thr Gly Thr Ser Phe Gin Asn Leu Asn Leu Ser 
20 25 30 

Ser His Asp Phe Gly Gly Asp Asp Leu Glu Gly Cys Asn Glu Asn Leu 
35 40 45 

Val Leu Ser Ser Pro Asn Thr Val Glu Gin Val His Asn Ser Phe Leu 
50 55 60 

Glu Ala Gly Cys His Val He Glu Thr Asn Thr Phe Gly Ala Ser Ser 
65 70 75 80 

lie Val Leu Asp Glu Tyr Ser He Ser Asn Lys Ala Tyr Glu He Asn 
85 90 95 



Lys Lys Ala Ala Gin He Ala Lys Lye Cys Ala Asn Leu Phe Ser Ser 
100 105 no 
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He Asn Thr Pro Arg Phe Val Ala Gly Ser He Gly Pro Thr Thr Lys 
115 120 125 

Leu Pro Thr Leu Gly His He Ser Phe Asp Lys Leu Lys Asp, Ser Tyr 
130 135 140 

Glu Glu Gin He Asn Gly Leu He Asp Gly Gly He Asp Leu Leu Leu 
145 150 155 160 

He Glu Thr Cys Gin Asp Val Leu Gin. lie Lys Ser Ala Leu Ser Ala 
165 170 " 175 

Ser Gin Glu Val He Lys Asn Arg Asn He Glu Leu Pro He Met He 
180 185 190 

Ser He Thr Met Glu Thr Thr Gly Thr Met Leu Val Gly Ser Asp He 
195 200 205 

Ala Ser Ala Leu Thr He Leu Glu Pro Tyr Asn He Asp He Leu Gly 
210 215 220 

Leu Asn Cys Ala Thr Gly Pro Val Gin Met Lys Glu His He Lys Tyr 
225 230 235 240 

Leu Ala Glu Asn Ser Pro Phe Ala He Ser Cys He Pro Asn Ala Gly 
245 250 255 

Leu Pro Glu Asn He Gly Gly Val Ala His Tyr Lys Leu Thr Pro Leu 
260 265 270 

Glu Leu Lys Met Gin Leu Met Asn Phe He Tyr Asp Phe Asn Val Gin 
275 280 285 

Leu He Gly Gly Cys Cys Gly Thr Thr Pro Glu His He Lys His Leu 
290 295 300 

Ser Ser He He Glu Glu He Val Asp Lys Lys He Asn Lys Arg Leu 
305 310 315 320 

Pro Thr Val Lys Thr Asn Phe Val Pro Ser Ala Ala Ser He Tyr Asn 
325 330 335 

Ala Val Pro Tyr Lys Gin Asp Asn Ser He Leu He Val Gly Glu Arg 
340 345 350 

Leu Asn Ala Ser Gly Ser Lys Lys Val Arg Glu Leu Leu Asn Glu Asp 
355 360 365 

Asp Trp Asp Gly Leu Leu Ser He Ala Lys Gin Gin Gin Lys Glu Asn 
370 375 380 

Ala His He Leu Asp Val Asn Val Asp Tyr Val Gly Arg Asp Gly Val 
385 390 395 ~ 400 

Lys Asp Met Lys Glu He Thr Ser Arg Leu Val Thr Asn He Asn Leu 
405 410 415 

Pro Leu Met He Asp Ser Thr Glu Ala Asp Lys Met Glu Ser Gly Leu 
420 425 430 

Lys Thr Val Gly Gly Lys Cys He He Asn Ser Thr Asn Tyr Glu Asp 
435 440 445 



I 
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Gly Asp Asp Arg Phe Asn Gin Val Leu Arg Leu Ala Leu Asp Tyr Gly 
450 455 460 

Ala Gly He Val He Gly Thr He Asp Glu Asp Gly Met Ala Arg Thr 
465 470 475 480 

Ser Gin Lys Lys Tyr Asp He Ala Lys Arg Ala Leu He Lys Thr Arg 
485 490 495 

Ser Ser Gly Leu Ala Asp Tyr Glu He Phe Phe Asp Pro Leu Ala Leu 
500 505 510 

Pro He Ser Thr Gly He Glu Glu Asp Arg Leu Asn Ala Lys Ala Thr 
515 520 525 

He Glu Ala He Ser Lys He Arg Lys Ser Phe Pro Asp He His He 
530 535 540 

He Leu Gly He Ser Asn He Ser Phe Gly Leu Ser Pro Leu Ser Arg 
545 550 555 560 

He Asn Leu Asn Ser He Phe Leu Asp Glu Cys He Lys Ala Gly Leu 
565 570 575 

Asp Ser Ala He He Ala Pro Asn Lys He Leu Pro Leu Ser Lys He 
580 585 590 

Ser Ala Glu Thr Lys Lys Leu Cys Leu Asp Leu He Tyr Asp Arg Arg 
595 600 605 

Asn Phe Glu Asn Glu He Cys He Tyr Asp Pro Leu Val Glu Leu Thr 
610 615 620 

Lys Ala Phe Gin Asp He Thr He Ser Asp Phe Lys Lys Gly Ser Thr 
"5 630 635 640 

Ser Asn Lys Asn Leu Thr Leu Glu Glu Lys Leu Lys Asn His He Val 
645 650 655 

Asp Gly Glu Lys He Gly Leu Glu Glu Gin Leu Asn Asn Ala Leu Lys 
660 665 670 

Lys Tyr Lys Pro Leu Glu He He Asn Thr Tyr Leu Leu Asp Gly Met 
675 680 685 

Lys Val Val Gly Glu Leu Phe Gly Ser Gly Gin Met Gin Leu Pro Phe 
690 695 700 

Val Leu Gin Ser Ala Glu Thr Met Lys Phe Ala Val Ser Val Leu Glu 
705 710 715 720 

Pro His Met Glu Thr Val Asp Glu Lys He Ser Asn Gly Lys Leu Leu 
725 730 735 

He Ala Thr Val Lys Gly Asp Val His Asp lie Gly Lys Asn Leu Val 
740 745 750 

Asp He He Leu Ser Asn Asn Gly Phe Asp Val He Asn Leu Gly He 
755 760 765 

Lys Gin Asp Val Ser Ala He lie Asp Ala Gin Lys Lys His Lys Ala 
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770 775 780 

Asp Cys He Ala Met Ser Gly Leu Leu Val Lys Ser Thr Ala Phe Met 
785 790 795 , 800 

Lye Asp Asn Leu Glu Ala Phe Asn Asn Ala Glu He Asn Val Pro Val 
805 810 815 

He Leu Gly Gly Ala Ala Leu Thr Pro Lys Phe Val Asn Glu Asp Cys 
820 825 830 

Ser Gin He Tyr Lys Gly Lys He Leu Tyr Gly Lys Asp Ala Phe Thr 
835 840 * 845 

Asp Leu Gin Phe Met Asn Asp Tyr Met Asp Ser Lys Lys Lys Gly Asn 
850 855 860 

Trp Ser Asn Glu Asn Gly Phe Thr Asn Thr Asp Asp He Gin He Lys 
8 « 870 875 880 

Leu Ala Ser Pro Arg Ser Ser Ala Lys Asp Lys Asn Leu Asn Lys Asn 
885 890 895 

Phe Glu Lys Thr Lys Ser He Gin Leu He Glu Asn Phe Asn Arg Ser 
900 905 910 

Asn Phe Val Glu Glu Glu Glu Pro He Lys Ala Pro Phe Leu Gly Thr 
915 920 925 

Arg Val Leu Gin Asp He Glu He Asp Phe Asp Lys Leu He Phe Tyr 
930 935 940 

Leu Asp Lys Lys Ala Leu Phe Ser Gly Gin Trp Gin He Lys Lys Asn 
945 950 955 960 

Lys Gly Gin Ser Val Glu Glu Tyr Asn Asn Tyr Leu Asp Ser Tyr Ala 
965 970 975 

Asn Pro Leu Leu Glu Lys Trp He Asn He He Leu Asp Lys Gly Leu 
980 965 990 

He Ser Pro Lys Val Val Tyr Gly Tyr Phe Arg Cys Gly Arg Asn Asp 
995 1000 1005 

Asn Ser He Tyr Leu Phe Asp Asn Val Ser Asn Lys Arg He Ser Glu 
1010 1015 1020 

Phe Asn Phe Pro Arg Gin Lys Ser Gly Asn Asn Leu Cys He Ala Asp 
1025 1030 1035 1040 

Phe Tyr Cys Asp Leu Lys Asn Asn Asp Pro Val Asp He Phe Pro Met 
1045 1050 1055 

Gin Ala Val Thr Met Gly Glu He Ala Ser Glu Tyr Ser Gin Glu Leu 
1060 1065 1070 

Phe Lys Ala Asp Lys Tyr Ser Asp Tyr Leu He Phe His Gly Leu Thr 
1075 1080 1085 

Val Gin Leu Ala Glu Ala Leu Ala Glu Tyr Val His Ser He Val Arg 
1090 1095 1100 



( 
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He Glu Cys Gly Phe Lys Ser Tyr Glu Pro Asn Aen Asn Arg Asp He 
1105 1H0 ills 1120 

Leu Ala Gin Lys Tyr Arg Gly Ala Arg Tyr Ser Phe Gly Tyr Pro Ala 
H25 H30 H35 

Cys Pro Lye Val Ser Asp Ser Asn He Gin Leu Ser Leu Leu Asp Thr 
1140 H45 1150 

Lys Arg He Asn Leu Thr Met Asp Glu Ser Glu Gin Leu His Pro Glu 
H55 H60 H65 

Gin Ser Thr Thr Ala He He Ser Leu His Ser Lys Ala Lys Tyr Phe 
H70 H75 1180 

Ser Ala 
1185 



<210> 9 
<211> 3048 
<212> DNA 

<213> Thermus thermophilus 

<220> 

<221> CDS 

<222> (1).. (3045) 

<223> RTT00266 

<400> 9 

atg egg gec tac aag gag gcg gca egg ggg ctt ctt aag ggc ggg gtg 4 8 
Met Arg Ala Tyr Lys Glu Ala Ala Arg Gly Leu Leu Lys Gly Gly Val 
1 5 10 15 

gac etc ate etc ttg gag ace gee cag gac ate etc cag gtg cgc tgc 96 
Asp Leu He Leu Leu Glu Thr Ala Gin Asp He Leu Gin Val Arg Cys 
20 25 30 

gee gtc ttg gcg gtg egg gag gee atg gee gag gtg ggc egg gag gtg 144 
Ala Val Leu Ala Val Arg Glu Ala Met Ala Glu Val Gly Arg Glu Val 
35 40 45 

ccc etc cag gtc cag gtg ace ttt gag gee acg ggg acg atg etc gtg 192 
Pro Leu Gin Val Gin Val Thr Phe Glu Ala Thr Gly Thr Met Leu Val 
50 55 60 

ggc acg gac gag cag gcg gec ctg gee get ctg gag age etc ccc gtg 240 
Gly Thr Asp Glu Gin Ala Ala Leu Ala Ala Leu Glu Ser Leu Pro Val 
65 70 75 80 

gac gtg gtg ggg atg aac tgc gee acg ggc ccc gac etc atg gac age 288 
Asp Val Val Gly Met Asn Cys Ala Thr Gly Pro Asp Leu Met Asp Ser 
85 90 95 



aag gtg cgc tac ttc gec gag cac age acc cgc ttc gtc tec tgc etc 
Lys Val Arg Tyr Phe Ala Glu His Ser Thr Arg Phe Val Ser Cys Leu 
100 105 no 



336 



ccg aac gcg ggc ctg ccc egg aac gag ggg ggg agg gtg gtc tac gac 364 
Pro Asn Ala Gly Leu Pro Arg Asn Glu Gly Gly Arg Val Val Tyr Asp 
115 120 125 
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etc acc ccc gag gag etc gee aag tgg cac etc aag ttc gtg gee gag 432 

Leu Thr Pro Glu Glu Leu Ala Lys Trp His Leu Lys Phe Val Ala Glu 

130 135 140 

tac ggg gtg aac gec gtg ggg gga tgc tgc ggc acg ggg ccc gag cac 480 
Tyr Gly Val Asn Ala Val Gly Gly Cys Cys Gly Thr Gly Pro Glu His 
145 150 155 160 

ata agg aag gtg gec gag gcg gtg aag ggg etc gec ccg aag cca agg 528 
lie Arg Lys Val Ala Glu Ala Val Lys Gly Leu Ala Pro Lys Pro Arg 
165 170 ' 175 

ccc gaa age ttc cct ccc cag gtg gee tec ttg tac cag gcg gtg tec 576 
Pro Glu Ser Phe Pro Pro Gin Val Ala Ser Leu Tyr Gin Ala Val Ser 
180 185 190 

etc aag cag gag gcg age ctt ttc etc gtg ggg gag agg etc aac gee 624 
Leu Lys Gin Glu Ala Ser Leu Phe Leu Val Gly Glu Arg Leu Asn Ala 
195 200 205 

acg ggg age aag cgc ttc egg gag atg etc ttc gcg aga gac etc gag 672 
Thr Gly Ser Lys Arg Phe Arg Glu Met Leu Phe Ala Arg Asp Leu Glu 
210 215 220 

ggc ate etc gee etc gec egg gag cag gtg gag gag ggg gee cac gee 720 
Gly lie Leu Ala Leu Ala Arg Glu Gin Val Glu Glu Gly Ala .His Ala 
225 230 235 240 

ctg gac etc tec gtg gee tgg acg ggg egg gac gag ctt gag gac etc 768 
Leu Asp Leu Ser Val Ala Trp Thr Gly Arg Asp Glu Leu Glu Asp Leu 
245 250 255 

egg tgg etc ctt ccc cat etc gee acc gee ctt acc gtc ccc gtc atg 816 
Arg Trp Leu Leu Pro His Leu Ala Thr Ala Leu Thr Val Pro Val Met 
260 265 270 

gtg gac tec acc tec cct gag gee atg gag etc gee etc aaa tac etc 864 
Val Asp Ser Thr Ser Pro Glu Ala Met Glu Leu Ala Leu Lys Tyr Leu 
275 280 285 

ccg ggc egg gtc etc ctg aac tec gee aac etc gag gat ggc tta gag 912 
Pro Gly Arg Val Leu Leu Asn Ser Ala Asn Leu Glu Asp Gly Leu Glu 
290 295 300 

cgc ttt gac egg gtg gec tec ctg gee aag gee cac ggg gcg gec etc 960 
Arg Phe Asp Arg Val Ala Ser Leu Ala Lys Ala His Gly Ala Ala Leu 
305 310 315 320 

gtg gtc etc gec att gac gag aag ggg atg gee aag acc egg gag gag 1008 
Val Val Leu Ala lie Asp Glu Lys Gly Met Ala Lys Thr Arg Glu Glu 
325 330 335 

aag gtg egg gtg gec ctg agg atg tac gag cgc etc acg gag cac cac 1056 
Lys Val Arg Val Ala Leu Arg Met Tyr Glu Arg Leu Thr Glu His His 
340 345 350 

ggc etc cgc ccc gag gac etc etc ttt gac etc ctt acc ttc ccc ate 1104 
Gly Leu Arg Pro Glu Asp Leu Leu Phe Asp Leu Leu Thr Phe Pro He 
355 360 365 



acc caa ggg gac gag gag age cgc cct ctg gee aag gag acc etc etc 
Thr Gin Gly Asp Glu Glu Ser Arg Pro Leu Ala Lys Glu Thr Leu Leu 



1152 
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370 375 380 

gcc ata gag gag eta egg gag agg ctt ccc ggg gtg ggc ttc gtc ctt 1200 
Ala lie Glu Glu Leu Arg Glu Arg Leu Pro Gly Val Gly Phe Val Leu 
385 390 395 400 

egg gtc tec aac gtc tec ttc ggg etc aag ccc egg gcg agg cgc gtc 1248 
Arg Val Ser Asn Val Ser Phe Gly Leu LyB Pro Arg Ala Arg Arg Val 
405 410 415 

ctg aac tec gtc ttc ctg gac gag gcg agg aaa egg ggc etc acc gcg 1296 
Leu Asn Ser Val Phe Leu Asp Glu Ala Arg Lye Arg Gly Leu Thr Ala 
420 425 ~ 430 

gcc ate gtg gac gcg ggg aag ate etc ccc ata age cag ate ccc gag 1344 
Ala He Val Asp Ala Gly Lye He Leu Pro He Ser Gin lie Pro Glu 
435 440 445 

gag gcc tac gcc etc gcc tta gac etc ate tac gac cgc cgc aag gag 1392 
Glu Ala Tyr Ala Leu Ala Leu Asp Leu He Tyr Asp Arg Arg Lys Glu 
450 455 460 

ggc ttt gac ccc etc etc gcc ttc atg gcc tac ttt gag gcc cac aag 1440 
Gly Phe Asp Pro Leu Leu Ala Phe Met Ala Tyr Phe Glu Ala His Lys 
465 470 475 480 

gag gac ccg ggg aag agg gag gac gcc ttc ctg gcc ctt ccc ctt ctg 1488 
Glu Asp Pro Gly Lys Arg Glu Asp Ala Phe Leu Ala Leu Pro Leu Leu 
48S 490 495 

gag agg etc aag cgc cgc gtg gtg gag ggg agg aag cag ggc etc gag 1536 
Glu Arg Leu Lys Arg Arg Val Val Glu Gly Arg Lys Gin Gly Leu Glu 
500 505 510 

gcc gac ctg gag gag gcc ctg aag gcg ggg cac aag ccc ttg gac etc 1584 
Ala Asp Leu Glu Glu Ala Leu Lys Ala Gly His Lys Pro Leu Asp Leu 
515 520 " 525 

ate aac ggc ccc etc etc gcg ggg atg aag gag gtg ggg gac etc ttc 1632 
He Asn Gly Pro Leu Leu Ala Gly Met Lys Glu Val Gly Asp Leu Phe 
530 535 540 

ggg gcg ggg aag atg cag etc ccc ttc gtc etc cag gcc gcc gag gtg 1680 
Gly Ala Gly Lys Met Gin Leu Pro Phe Val Leu Gin Ala Ala Glu Val 
545 550 555 560 

atg aag egg gcg gtg gcc tac etc gag ccc cac atg gag aag aag ggg 1728 
Met Lys Arg Ala Val Ala Tyr Leu Glu Pro His Met Glu Lys Lys Gly 
565 570 575 

gag ggc aag ggt acc ctg gtc etc gcc acc gtc aag ggg gac gtg cac 1776 
Glu Gly Lys Gly Thr Leu Val Leu Ala Thr Val Lys Gly Asp Val His 
580 585 590 

gac ate ggc aag aac ctg gtg gac ate ate etc age aac aac ggc tac 1824 
Asp He Gly Lys Asn Leu Val Asp He He Leu Ser Asn Asn Gly Tyr 
595 600 60S 

egg gtg gtg aac ctg ggg ate aag gtg ccc att gag gag ate ctg aag 1872 
Arg Val Val Asn Leu Gly He Lys Val Pro He Glu Glu He Leu Lys 
610 615 620 
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gcc gtg gag gcg cac aag ccc cac gcc gtg ggc atg teg ggc etc ctg 1920 
Ala Val Glu Ala His Lys Pro His Ala Val Gly Met Ser Gly Leu Leu 
625 630 635 640 

gtg aag age ace ctg gtg atg aag gag aac ctg gag tac atg egg gat 1968 
Val Lys Ser Thr Leu Val Met Lys Glu Asn Leu Glu Tyr Met Arg Asp 
645 650 655 



agg ggc tac acc etc ccc gtg ate ctg ggc ggg gcc gcc etc acc egg 2016 
Arg Gly Tyr Thr Leu Pro Val He Leu Gly Gly Ala Ala Leu Thr Arg 
660 665 670 

age tac gtg gag gag ctt aag gcc ate tac ccc aac gtc tac tac gcc 2064 
Ser Tyr Val Glu Glu Leu Lye Ala He Tyr Pro Asn Val Tyr Tyr Ala 
675 680 685 

gag gac gcc ttt gag ggc tta agg etc atg gag gag etc acg ggc cac 2112 
Glu Asp Ala Phe Glu Gly Leu Arg Leu Met Glu Glu Leu Thr Gly His 
690 695 700 

gcc cct ccc gag etc acc egg aag gcc cca get agg ccc aag egg gag 2160 
Ala Pro Pro Glu Leu Thr Arg Lys Ala Pro Ala Arg Pro Lys Arg Glu 
705 710 715 720 

gcc ccc aag gtg gcg ccc cgc get egg ccc gtg ggg gag gcc ccc gcc 2208 
Ala Pro Lys Val Ala Pro Arg Ala Arg Pro Val Gly Glu Ala Pro Ala 
725 730 735 

gtc ccc egg ccc ccc ttc ttc ggc gtg egg gtg gag gaa ggc ttg gac 2256 
Val Pro Arg Pro Pro Phe Phe Gly Val Arg Val Glu Glu Gly Leu Asp 
740 745 750 

etc gcc acc ate gcc cac tac gtc aac aag etc gcc etc tac egg ggc 2304 
Leu Ala Thr He Ala His Tyr Val Asn Lys Leu Ala Leu Tyr Arg Gly 
755 760 765 

cag tgg ggc tac age cgc aag ggc ttt ccc ggg agg cgt ggc agg ccc 2352 
Gin Trp Gly Tyr Ser Arg Lys Gly Phe Pro Gly Arg Arg Gly Arg Pro 
770 775 780 

tgg tgg age ggg agg egg age ctg tct tec aga ggc tec tea agg agg 24 00 
Trp Trp Ser Gly Arg Arg Ser Leu Ser Ser Arg Gly Ser Ser Arg Arg 
785 790 795 800 

cga tgg egg aag ggt ggc ttg aac cca agg tec tct acg get tct tec 2448 
Arg Trp Arg Lys Gly Gly Leu Asn Pro Arg Ser Ser Thr Ala Ser Ser 
805 810 815 

ccg tgg ccc ggg agg gga gga get tct cgt ctt etc ccc aga gac ggg 2496 
Pro Trp Pro Gly Arg Gly Gly Ala Ser Arg Leu Leu Pro Arg Asp Gly 
820 825 830 

gga ggt get gga gcg ctt ccg ctt ccc ccg gca aag ggg egg ggg cct 2544 
Gly Gly Ala Gly Ala Leu Pro Leu Pro Pro Ala Lys Gly Arg Gly Pro 
835 B40 845 

gag cct cgt gga eta ctt ccg ccc ccg gtt tgc cgc gcc ttt ggg gga 2592 
Glu Pro Arg Gly Leu Leu Pro Pro Pro Val Cys Arg Ala Phe Gly Gly 
850 855 860 



cga ggc gga ctg gat gcc caa gga ggc ctt ccg ggc ggg ggc egg gac 
Arg Gly Gly Leu Asp Ala Gin Gly Gly Leu Pro Gly Gly Gly Arg Asp 



2640 
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870 



40 

875 



880 



gtc etc ggg gtc cag etc gtc ace atg ggg gag gee cct tec cga aag 
Val Leu Gly Val Gin Leu Val Thr Met Gly Glu Ala Pro Ser Arg Lye 
885 890 895 



2688 



gec cag gee etc ttt gcg tec ggg gec tac cag gac tac etc ttc gtc 
Ala Gin Ala Leu Phe Ala Ser Gly Ala Tyr Gin Asp Tyr Leu Phe Val 
900 905 910 



2736 



cac ggc ttc age gtg gag atg acc gag gec ttg gcg gag tac tgg cac 2784 
His Gly Phe Ser Val Glu Met Thr Glu Ala Leu Ala Glu Tyr Trp His 
915 920 925 

aag agg atg egg cag atg tgg ggc ate gec cac aag gac gec acc gag 2832 
Lys Arg Met Arg Gin Met Trp Gly lie Ala His Lys Asp Ala Thr Glu 
930 935 940 

ate cag aag etc ttc cag cag ggc tac cag ggg gee cgc tac tec ttc 2880 
lie Gin Lys Leu Phe Gin Gin Gly Tyr Gin Gly Ala Arg Tyr Ser Phe 
945 950 955 960 

ggc tac ccc gec tgc ccg gac etc gee gac cag gee aag ctg gac egg 2928 
Gly Tyr Pro Ala Cys- Pro Asp Leu Ala Asp Gin Ala Lys Leu Asp Arg 
965 970 975 

etc atg ggc ttc cac egg gtg ggg gtg cac etc acg gag aac ttc cag 2976 
Leu Met Gly Phe His Arg Val Gly Val His Leu Thr Glu Asn Phe Gin 
980 985 990 

ctg gag ccg gag cac gee acc age gec etc gtg gtc cac cac ccc gag 3024 
Leu Glu Pro Glu His Ala Thr Ser Ala Leu Val Val His His Pro Glu 
995 1000 1005 

gec cgc tac ttc age gtg gac tag 3048 
Ala Arg Tyr Phe Ser Val Asp 
1010 1015 



<210> 10 
<2U> 1015 
<212> PRT 

<213> Thermus thermophilus 
<400> 10 

Met Arg Ala Tyr Lys Glu Ala 
1 5 



Ala Arg Gly Leu Leu Lys Gly Gly Val 
10 15 



Asp Leu lie Leu Leu Glu Thr 
20 



Ala Gin Asp He Leu Gin Val Arg Cys 
25 30 



Ala Val Leu Ala Val Arg Glu 
35 



Ala Met Ala Glu Val Gly Arg Glu Val 
40 45 



Pro Leu Gin Val Gin Val Thr 
50 55 



Phe Glu Ala Thr Gly Thr Met Leu Val 
60 



Gly Thr Asp Glu Gin Ala Ala 
65 70 

Asp Val Val Gly Met Asn Cys 
85 



Leu Ala Ala Leu Glu Ser Leu Pro Val 
75 80 

Ala Thr Gly Pro Abp Leu Met Asp Ser 
90 95 
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Lys Val Arg Tyr Phe Ala Glu His Ser Thr Arg Phe Val Ser Cys Leu 
100 105 110 

Pro Asn Ala Gly Leu Pro Arg Abii Glu Gly Gly Arg Val Val Tyr Asp 
115 120 125 

Leu Thr Pro Glu Glu Leu Ala Lys Trp His Leu Lys Phe Val Ala Glu 
130 135 140 

Tyr Gly Val Asn Ala Val Gly Gly Cys Cys Gly Thr Gly Pro Glu His 
145 150 155 160 

lie Arg Lys Val Ala Glu Ala Val Lys Gly Leu Ala Pro Lys Pro Arg 
165 170 175 

Pro Glu Ser Phe Pro Pro Gin Val Ala Ser Leu Tyr Gin Ala Val Ser 
180 185 190 

Leu Lys Gin Glu Ala Ser Leu Phe Leu Val Gly Glu Arg Leu Asn Ala 
195 200 205 

Thr Gly Ser Lys Arg Phe Arg Glu Met Leu Phe Ala Arg Asp Leu Glu 
210 215 220 

Gly lie Leu Ala Leu Ala Arg Glu Gin Val Glu Glu Gly Ala His Ala 
225 230 235 240 

Leu Asp Leu Ser Val Ala Trp Thr Gly Arg Asp Glu Leu Glu Asp Leu 
245 250 255 

Arg Trp Leu Leu Pro His Leu Ala Thr Ala Leu Thr Val Pro Val Met 
260 265 270 

Val Asp Ser Thr Ser Pro Glu Ala Met Glu Leu Ala Leu Lys Tyr Leu 
275 280 285 

Pro Gly Arg Val Leu Leu Asn Ser Ala Asn Leu Glu Asp Gly Leu Glu 
290 295 300 

Arg Phe Asp Arg Val Ala Ser Leu Ala Lys Ala His Gly Ala Ala Leu 
305 310 315 320 

Val Val Leu Ala lie Asp Glu Lys Gly Met Ala Lys Thr Arg Glu Glu 
325 330 335 

Lys Val Arg Val Ala Leu Arg Met Tyr Glu Arg Leu Thr Glu His His 
340 345 350 

Gly Leu Arg Pro Glu Asp Leu Leu Phe Asp Leu Leu Thr Phe Pro lie 
355 360 365 

Thr Gin Gly Asp Glu Glu Ser. Arg Pro Leu Ala Lys Glu Thr Leu Leu 
370 375 380 

Ala lie Glu Glu Leu Arg Glu Arg Leu Pro Gly Val Gly Phe Val Leu 
385 390 395 400 

Arg Val Ser Asn Val Ser Phe Gly Leu Lys Pro Arg Ala Arg Arg Val 
405 410 415 

Leu Asn Ser Val Phe Leu Asp Glu Ala Arg Lys Arg Gly Leu Thr Ala 
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420 425 430 

Ala lie Val Asp Ala Gly Lye He Leu Pro He Ser Gin He Pro Glu 
435 440 445 

Glu Ala Tyr Ala Leu Ala Leu Asp Leu He Tyr Asp Arg Arg Lys Glu 
450 455 460 

Gly Phe Asp Pro Leu Leu Ala Phe Met Ala Tyr Phe Glu Ala His Lys 
465 470 475 480 

Glu Asp Pro Gly Lys Arg Glu Asp Ala Phe Leu Ala Leu Pro Leu Leu 
485 490 495 

Glu Arg Leu Lys Arg Arg Val Val Glu Gly Arg Lys Gin Gly Leu Glu 
500 505 510 

Ala Asp Leu Glu Glu Ala Leu Lys Ala Gly His Lys Pro Leu Asp Leu 
515 520 525 

He Asn Gly Pro Leu Leu Ala Gly Met Lys Glu Val Gly Asp Leu Phe 
530 535 540 

Gly Ala Gly Lys Met Gin Leu Pro Phe Val Leu Gin Ala Ala Glu Val 
545 550 555 560 

Met Lys Arg Ala Val Ala Tyr Leu Glu Pro His Met Glu Lys Lys Gly 
565 570 575 

Glu Gly Lys Gly Thr Leu Val Leu Ala Thr Val Lys Gly Asp Val His 
580 585 590 

Asp He Gly Lys Asn Leu Val Asp He He Leu Ser Asn Asn Gly Tyr 
595 600 605 

Arg Val Val Asn Leu Gly He Lys Val Pro He Glu Glu He Leu Lys 
610 615 620 

Ala Val Glu Ala His Lys Pro His Ala Val Gly Met Ser Gly Leu Leu 
625 630 635 640 

Val Lys Ser Thr Leu Val Met Lys Glu Asn Leu Glu Tyr Met Arg Asp 
645 650 655 

Arg Gly Tyr Thr Leu Pro Val He Leu Gly Gly Ala Ala Leu Thr Arg 
660 665 670 

Ser Tyr Val Glu Glu Leu Lys Ala He Tyr Pro Asn Val Tyr Tyr Ala 
675 680 685 

Glu Asp Ala Phe Glu Gly Leu Arg Leu Met Glu Glu Leu Thr Gly His 
690 695 700 

Ala Pro Pro Glu Leu Thr Arg Lys Ala Pro Ala Arg Pro Lys Arg Glu 
705 710 715 ~ 720 

Ala Pro Lys Val Ala Pro Arg Ala Arg Pro Val Gly Glu Ala Pro Ala 
725 730 735 

Val Pro Arg Pro Pro Phe Phe Gly Val Arg Val Glu Glu Gly Leu Asp 
740 745 750 



WO 03/087386 PCT/EP03/04010 

43 

Leu Ala Thr He Ala His Tyr Val Asn Lys Leu Ala Leu Tyr Arg Gly 
755 760 765 

Gin Trp Gly Tyr Ser Arg Lys Gly Phe Pro Gly Arg Arg Gly, Arg Pro 
770 775 780 

Trp Trp Ser Gly Arg Arg Ser Leu Ser Ser Arg Gly Ser Ser Arg Arg 
785 790 795 800 

Arg Trp Arg Lys Gly Gly Leu Asn Pro Arg Ser Ser Thr Ala Ser Ser 
80S 810 815 

Pro Trp Pro Gly Arg Gly Gly Ala Ser Arg Leu Leu Pro Arg Asp Gly 
820 825 830 

Gly Gly Ala Gly Ala Leu Pro Leu Pro Pro Ala Lys Gly Arg Gly Pro 
835 840 845 

Glu Pro Arg Gly Leu Leu Pro Pro Pro Val Cys Arg Ala Phe Gly Gly 
850 855 860 

Arg Gly Gly Leu Asp Ala Gin Gly Gly Leu Pro Gly Gly Gly Arg Asp 
865 870 875 880 

Val Leu Gly Val Gin Leu Val Thr Met Gly Glu Ala Pro Ser Arg Lys 
885 690 895 

Ala Gin Ala Leu Phe Ala Ser Gly Ala Tyr Glh Asp Tyr Leu Phe Val 
900 90S 910 

His Gly Phe Ser Val Glu Met Thr Glu Ala Leu Ala Glu Tyr Trp His 
915 920 925 

Lys Arg Met Arg Gin Met Trp Gly He Ala His Lys Asp Ala Thr Glu 
930 935 940 

He Gin Lys Leu Phe Gin Gin Gly Tyr Gin Gly Ala Arg Tyr Ser Phe 
345 950 955 960 

Gly Tyr Pro Ala Cys Pro Asp Leu Ala Asp Gin Ala Lys Leu Asp Arg 
965 970 975 

Leu Met Gly Phe His Arg Val Gly Val His Leu Thr Glu Asn Phe Gin 
980 985 990 

Leu Glu Pro Glu His Ala Thr Ser Ala Leu Val Val His His Pro Glu 
995 1000 1005 

Ala Arg Tyr Phe Ser Val Asp 
1010 1015 



<210> 11 
<211> 3441 
<212> DNA 

<213> Bacillus halodurans 

<220> 
<221> CDS 
<222> (1) . . (3438) 
<223> RHD05550 



f 
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<400> 11 

atg act aaa teg ttg ttt gaa caa cag tta gag cga aaa ate gtc ate 48 
Met Thr Lys Ser Leu Phe Glu Gin Gin Leu Glu Arg LyB He Val He 
1 5 10 " 15 

ctt gat ggg gcg atg ggg ace atg tta caa gec gcg aat eta acc get 96 
Leu Asp Gly Ala Met Gly Thr Met Leu Gin Ala Ala Asn Leu Thr Ala 
20 25 30 

gat gac ttt ggc gga gaa gag tat gaa ggg tgt aat gaa tat tta aat 144 
Asp Asp Phe Gly Gly Glu Glu Tyr Glu Gly Cys Asn Glu Tyr Leu Asn 
35 40 45 

gag acg gee ccc cat gtc gtt gag gac att cat cgc gca tac tta gag 192 
Glu Thr Ala Pro His Val Val Glu Asp He His Arg Ala Tyr Leu Glu 
50 55 60 

gca gga gca gac gtc att gcg acg aac acg ttc ggg gca aca gat ate 240 
Ala Gly Ala Asp Val He Ala Thr Asn Thr Phe Gly Ala Thr Asp He 
65 70 75 80 

gtt ctt gac gat tat gat etc gga tac aaa gca gag gag tta aac ata 288 
Val Leu Asp Asp Tyr Asp Leu Gly Tyr Lys Ala Glu Glu Leu Asn He 
85 90 95 

tgc gcg gtg aaa ate get aaa cgt gta get gaa gag ttt tec act cca 336 
Cys Ala Val Lys He Ala Lys Arg Val Ala Glu Glu Phe Ser Thr Pro 
100 105 no 

gat tgg cct cga ttc gtt gca ggg gcg atg ggg ccg acg acg aaa tct 384 
Asp Trp Pro Arg Phe Val Ala Gly Ala Met Gly Pro Thr Thr Lys Ser 
115 120 125 

ctt tec gtc aca ggg ggc gcg aca ttc gaa caa ctt ate gag tct tat 432 
Leu Ser Val Thr Gly Gly Ala Thr Phe Glu Gin Leu He Glu Ser Tyr 
130 135 140 

cgc cag caa get aca ggt eta att aaa ggc ggg gcg gat att tta tta 480 
Arg Gin Gin Ala Thr Gly Leu He Lys Gly Gly Ala Asp He Leu Leu 
1*5 150 155 160 

etc gaa acg age cag gat atg cga aac gtg aag gcg get tat tta gga 528 
Leu Glu Thr Ser Gin Asp Met Arg Asn Val Lys Ala Ala Tyr Leu Gly 
165 170 175 

ctg age caa gcg caa aaa gag eta gag gtg aaa ctg cct etc att att 576 
Leu Ser Gin Ala Gin Lys Glu Leu Glu Val Lys Leu Pro Leu He He 
180 185 190 

tct gga acg att gaa ccg atg gga aca acg etc gec ggc caa aac ate 624 
Ser Gly Thr He Glu Pro Met Gly Thr Thr Leu Ala Gly Gin Asn He 
195 200 205 

gag gcg ttc tat ttg tea tta gag cat atg aat ccc gtc gtt gtc ggt 672 
Glu Ala Phe Tyr Leu Ser Leu Glu His Met Asn Pro Val Val Val Gly 
210 215 220 

etc aac tgc get aca gga cca gaa ttt atg cgc gat cac etc cgt tct 720 
Leu Asn Cys Ala Thr Gly Pro Glu Phe Met Arg Asp His Leu Arg Ser 
225 230 235 240 

ctt tea gac ctt gcg acc tgc tct gta age tgt tat ccg aat get ggg 768 
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Leu Ser Asp Leu Ala Thr Cys Ser Val Ser Cys Tyr Pro Asn Ala Oly 
245 250 255 

tta cct gat gaa gag ggg aac tat cac gaa tec cca gaa tea, tta gca 816 
Leu' Pro Asp Glu Qlu Gly Asn Tyr His Glu Ser Pro Glu Ser Leu Ala 
260 265 270 

gec aag etc gca ggt ttt gcg gaa aag ggc tgg ttg aat atg gtt ggt 864 
Ala Lys Leu Ala Gly Phe Ala Glu Lys Gly Trp Leu Asn Met Val Gly 
275 280 285 

ggc tgt tgc ggg acg act cca gac cac att cgt get ctt ttg gac gtt 912 
Gly Cys Cys Gly Thr Thr Pro Asp His lie Arg Ala Leu Leu Asp Val 
290 295 300 

atg aag caa ttt gag ccg aga caa cca aaa ggg gat cac ccc cac teg 960 
Met Lys Gin Phe Glu Pro Arg Gin Pro Lys Gly Asp His Pro His Ser 
305 310 315 32€ 

gtc tea gga att gag cca ctg tta tac gat gac age atg cgt cca eta 1008 
Val Ser Gly He Glu Pro Leu Leu Tyr Asp Asp Ser Met Arg Pro Leu 
325 330 335 

ttt gtc ggt gaa egg aca aac gtc ate ggg tct cgt aaa ttt aaa egg 1056 
Phe Val Gly Glu Arg Thr Asn Val He Gly Ser Arg Lys Phe Lys Arg 
340 345 350 

ttg ate gaa gaa gaa aaa tat gaa gaa gee tea gaa att gca. aga tec 1104 
Leu He Glu Glu Glu Lys Tyr Glu Glu Ala Ser Glu He Ala Arg Ser 
355 360 365 

caa gtg aag aaa ggg gee cac gtt ate gat gtt tgt ctt get gat ccg 1152 
Gin Val Lys Lys Gly Ala His Val He Asp Val Cys Leu Ala Asp Pro 
370 375 380 

gat cgc gat gaa atg gag gac atg gag gaa ttt tta aaa ttc gtg ate 1200 
Asp Arg Asp Glu Met Glu Asp Met Glu Glu Phe Leu Lys Phe Val He 
385 390 395 400 

aac aaa gtg aag gta ccg etc atg att gac tec acc gac gaa aag gta 1248 
Asn Lys Val Lys Val Pro Leu Met He Asp Ser Thr Asp Glu Lys Val 
405 410 415 

att gaa caa gcg ctt acg tat tea caa ggg aaa gcg ate att aat teg 1296 
He Glu Gin Ala Leu Thr Tyr Ser Gin Gly Lys Ala He He Asn Ser 
420 425 430 

ate aac tta gag gac ggc gaa gaa cgt ttt gaa aaa gtg gtc ccg etc 1344 
lie Asn Leu Glu Asp Gly Glu Glu Arg Phe Glu Lys Val Val Pro Leu 
435 440 445 

gtc cat aag tat gga gee gcg gtt gtc gtt ggt acg ate gac gaa gaa 1392 
Val His Lys Tyr Gly Ala Ala Val Val Val Gly Thr He Asp Glu Glu 
450 455 460 

gga atg gcg att acg gca gaa aaa aaa tta gcg gtt gcg aaa cga tea 1440 
Gly Met Ala He Thr Ala Glu Lys Lys Leu Ala Val Ala Lys Arg Ser 
465 470 475 * 480 

tac gac ctg etc gta aac aaa tac aac att cgt ccg age gat att att 14 88 
Tyr Asp Leu Leu Val Asn Lys Tyr Asn He Arg Pro Ser Asp He He 
485 490 495 
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ttt gat ccg etc gtg ttc cca gta gga aca ggc gat gag caa tac att 1536 
Phe Asp Pro Leu Val Phe Pro Val Gly Thr Gly Asp Glu Gin Tyr lie 
500 505 510 

ggc teg gcg aat gag acg gtg gaa gga att agg agg ate aaa gaa gag 1584 
Gly ser Ala Asn Glu Thr Val Glu Gly lie Arg Arg lie Lys Glu Glu 
515 520 525 

etc cct gaa tgt tta acg att ctt gga gtt agt aac gtg teg ttc ggt 1632 
Leu Pro Glu Cye Leu Thr He Leu Gly Val Ser Asn Val Ser Phe Gly 
530 535 540 

ctt ccg cct gtc gga aga gag gtg ctg aac gcg gcg tac tta tac cat 1680 
Leu Pro Pro Val Gly Arg Glu Val Leu Asn Ala Ala Tyr Leu Tyr His 
545 550 555 560 

tgt aca caa get ggc ctt gat tac get ate gtg aac aca gaa aag ctt 1728 
Cya Thr Gin Ala Gly Leu Asp Tyr Ala He Val Asn Thr Glu Lys Leu 
565 570 575 

gag cgt tat gee teg att tct gat gaa gaa aaa gaa ttg tea agg aag 1776 
Glu Arg Tyr Ala Ser He Ser Asp Glu Glu Lys Glu Leu Ser Arg Lys 
580 585 590 

etc tta ttt gaa acg aca gat gaa acg etc get gag ttc acc gee ttt 1824 
Leu Leu Phe Glu Thr Thr Asp Glu Thr Leu Ala Glu Phe Thr Ala Phe 
595 600 605 

tat cga ggg aaa aaa gca gag aaa aaa gtg gag act tct aat tta act 1872 
Tyr Arg Gly Lys Lys Ala Glu Lys Lys Val Glu Thr Ser Asn Leu Thr 
610 615 620 

ttg gaa gag egg ttg gca aac tac att gtt gaa ggg tea aag gac gga 1920 
Leu Glu Glu Arg Leu Ala Asn Tyr He Val Glu Gly Ser Lys Asp Gly 
625 630 635 640 

ctg aca gaa gat tta gat aaa gcg etc gcg aaa tat gat gat ccg ctt 1968 
Leu Thr Glu Asp Leu Asp Lys Ala Leu Ala Lys Tyr Asp Asp Pro Leu 
645 650 655 

gat ate att aac ggc ccg etc atg aat gga atg gac gaa gtc ggt cgt 2016 
Asp lie lie Asn Gly Pro Leu Met Asn Gly Met Asp Glu Val Gly Arg 
660 665 670 

ttg ttt aac aat aac gag ctt att gtc get gaa gta ttg caa age get 2064 
Leu Phe Asn Asn Asn Glu Leu He Val Ala Glu Val Leu Gin Ser Ala 
675 680 685 

gag gtt atg aag get tec gtc gee cac ctt gag cca cat atg gaa aag 2112 
Glu Val Met Lys Ala Ser Val Ala His Leu Glu Pro His Met Glu Lys 
690 695 700 

aaa gca gac gat cat gga aaa gga aaa ate att ctt gec acg gtc aag 2160 
Lys Ala Asp Asp His Gly Lys Gly Lys lie lie Leu Ala Thr Val Lys 
705 710 715 720 

ggc gat gtt cac gat ate ggg aaa aat eta gtg gaa att att ttg age 2208 
Gly Asp Val His Asp lie Gly Lys Asn Leu Val Glu He lie Leu Ser 
725 730 735 



aat aat ggt ttc cgc ate gtg aac eta gga att aaa gtt acc tct aat 



2256 
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Asn Asn Gly Phe Arg He Val Asn Leu Gly He Lys Val Thr Ser Asn 
740 745 750 

gag ctg att gaa gcg gtg gcg aga gaa aat cca gat gcg att ggc ttg 2304 
Glu Leu He Glu Ala Val Ala Arg Glu Asn Pro Asp Ala He - Gly Leu 
755 760 765 

tea ggg ttg etc gtc aaa tea gca caa caa atg gta ctt ace gee caa 2352 
Ser Gly Leu Leu Val Lye Ser Ala Gin Gin Met Val Leu Thr Ala Gin 
770 775 780 

gat ttg aag caa caa caa att tec att ccg att tta gtc gga ggc gca 2400 
Asp Leu Lys Gin Gin Gin He Ser He Pro He Leu Val Gly Gly Ala 
7B5 790 795 800 

gec ctt acg egg aaa ttt acg aat aca aaa ate get cca gag tat gat 2448 
Ala Leu Thr Arg Lys Phe Thr Asn Thr Lys He Ala Pro Glu Tyr Asp 
805 610 815 

ggt etc gtc gtc tac gcg aag gat gcg atg aac ggg tta gag ctt gec 2496 
Gly Leu Val Val Tyr Ala Lys Asp Ala Met Asn Gly Leu Glu Leu Ala 
820 825 830 

aat aaa tta atg aaa cct gat gaa cga gaa aag eta gcg gtc tec etc 2544 
Asn Lys Leu Met Lys Pro Asp Glu Arg Glu Lys Leu Ala Val Ser Leu 
835 840 845 

cat gaa gcg aag gag cag gcg aac teg agg aca caa atg gga gga ggc 2592 
His Glu Ala Lys Glu Gin Ala Asn Ser Arg Thr Gin Met Gly Gly Gly 
850 855 860 

gga act gca gtt gcg gta aag ccg act cga tec cat gtt teg aca acg 2640 
Gly Thr Ala Val Ala Val Lys Pro Thr Arg Ser His Val Ser Thr Thr 
865 870 875 880 

gtg cct gta gcg gtc cca cct gat gtg aag ccg cac att ttg cgc cac 2688 
Val Pro Val Ala Val Pro Pro Asp Val Lys Pro His He Leu Arg His 
885 890 895 

cat age att gec cat tta gag ccg tat att aac atg cag atg ttg tta 2736 
HiB Ser He Ala His Leu Glu Pro Tyr He Asn Met Gin Met Leu Leu 
900 905 910 

gga cgt cac tta ggc tta caa ggg aaa gtg age cgc ctg ctt gca gaa 2784 
Gly Arg His Leu Gly Leu Gin Gly LyB Val Ser Arg Leu Leu Ala Glu 
315 920 925 

aaa gac gag aag get ctt gaa tta aaa gaa aaa gtt gat gcg eta etc 2832 
Lys Asp Glu Lys Ala Leu Glu Leu Lys Glu Lys Val Asp Ala Leu Leu 
930 935 940 

ace agg gtg aaa gag gag cag etc atg gaa gee cat ggc atg tat cag 2880 
Thr Arg Val Lys Glu Glu Gin Leu Met Glu Ala His Gly Met Tyr Gin 
945 950 955 * 960 

ttt ttt cct gee cag teg gat ggg gac gat att gtc att tat gat caa 2928 
Phe Phe Pro Ala Gin Ser Asp Gly Asp Asp He Val He Tyr Asp Gin 
965 970 975 

acg gga aca aat gaa ate gag cga ttc cat ttt ccg cgt cag aat aag 2976 
Thr Gly Thr Asn Glu He Glu Arg Phe His Phe Pro Arg Gin Asn Lys 
980 985 990 



I 
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gag cct tat ctg tgt ctt gcc gat ttc ctt cgc cca gtt tec agt ggg 3024 
Glu Pro Tyr Leu Cys Leu Ala Asp Phe Leu Arg Pro Val Ser Ser Gly 
995 1000 1005 

gaa atg gac tat gtt ggc ttc ctt get gta ace gca gga aaa ggc att 3072 
Glu Met Asp Tyr Val Gly Phe Leu Ala Val Thr Ala Gly Lys Gly He 
1010 1015 1020 

cgt gaa tta ggg gag cag gcg aaa gag get gga gac tat tta ttc agt 3120 
Arg Glu Leu Gly Glu Gin Ala Lys Glu Ala Gly Asp Tyr Leu Phe Ser 
1Q 25 1030 1035 1040 

cac tta ate caa gca aca gcc tta gag atg gcg gaa ggg ttt gcc gag 3168 
His Leu He Gin Ala Thr Ala Leu Glu Met Ala Glu Gly Phe Ala Glu 
1045 1050 1055 

cgt gtc cat cag etc atg cgt gat aag tgg ggg ttt cct gat teg get 3216 
Arg Val His Gin Leu Met Arg Asp Lys Trp Gly Phe Pro Asp Ser Ala 
1060 1065 1070 

gac ttt aca atg gaa gag cgt ttc get gca aaa tac cgt ggc ate cgt 3264 
Asp Phe Thr Met Glu Glu Arg Phe Ala Ala Lys Tyr Arg Gly He Arg 
1075 1080 " 1085 

gta teg ttt ggc tac cct gca tgc cct gac ttg gat gac caa gca aag 3312 
Val Ser Phe Gly Tyr Pro Ala Cys Pro Asp Leu Asp Asp Gin Ala Lys 
1090 1095 HOO 



ttg ttt aag ctg ttg aag cct gga aag ate gga att gag ttg acg gaa 3360 
Leu Phe Lys Leu Leu Lys Pro Gly Lys He Gly He Glu Leu Thr Glu 
1105 IHO 1115 1120 

ggg ttt atg atg gag cca gaa gcc tec gtc ace gcg atg gtg ttt gcc 3408 
Gly Phe Met Met Glu Pro Glu Ala Ser Val Thr Ala Met Val Phe Ala 
1125 H30 H35 



cat cct gag get cgc tat ttt aat gtt tta tag 
Hie Pro Glu Ala Arg Tyr Phe Asn Val Leu 
1140 H45 



3441 



<210> 12 
<211> 1146 
<212> PRT 

<213> Bacillus halodurans 
<400> 12 

Met Thr Lys Ser Leu Phe Glu Gin Gin Leu Glu Arg Lys He Val He 
1 5 10 is 

Leu Asp Gly Ala Met Gly Thr Met Leu Gin Ala Ala Asn Leu Thr Ala 
20 25 30 

Asp Asp Phe Gly Gly Glu Glu Tyr Glu Gly Cys Asn Glu Tyr Leu Asn 
35 40 45 

Glu Thr Ala Pro His Val Val Glu Asp He His Arg Ala Tyr Leu Glu 
50 55 60 

Ala Gly Ala Asp Val lie Ala Thr Asn Thr Phe Gly Ala Thr Asp He 
65 70 75 80 
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Val Leu Asp Asp Tyr Asp Leu Gly Tyr Lys Ala Glu Glu Leu Asn He 
85 90 95 

Cyd Ala Val Lys He Ala Lys Arg Val Ala Glu Glu Phe Ser' Thr Pro 
100 105 no 

Asp Trp Pro Arg Phe Val Ala Gly Ala Met Gly Pro Thr Thr Lys Ser 
115 120 125 

Leu Ser Val Thr Gly Gly Ala Thr Phe Glu Gin Leu He Glu Ser Tyr 
130 135 140 

Arg Gin Gin Ala Thr Gly Leu He Lys Gly Gly Ala Asp He Leu Leu 
"5 150 155 160 

Leu Glu Thr Ser Gin Asp Met Arg Asn Val Lys Ala Ala Tyr Leu Gly 
165 170 175 

Leu Ser Gin Ala Gin Lys Glu Leu Glu Val Lys Leu Pro Leu He He 
180 185 190 

Ser Gly Thr He Glu Pro Met Gly Thr Thr Leu Ala Gly Gin Asn He 
195 200 205 

Glu Ala Phe Tyr Leu Ser Leu Glu His Met Asn Pro Val Val Val Gly 
210 215 220 

Leu Asn Cys Ala Thr Gly Pro Glu Phe Met Arg Asp His Leu Arg Ser 
225 230 235 240 

Leu Ser Asp Leu Ala Thr Cys Ser Val Ser Cye Tyr Pro Asn Ala Gly 
245 250 255 

Leu Pro Asp Glu Glu Gly Asn Tyr His Glu Ser Pro Glu Ser Leu Ala 
260 265 270 

Ala Lys Leu Ala Gly Phe Ala Glu Lye Gly Trp Leu Asn Met Val Gly 
275 280 285 

Gly Cys Cys Gly Thr Thr Pro Asp His He Arg Ala Leu Leu Asp Val 
290 295 300 

Met Lys Gin Phe Glu Pro Arg Gin Pro Lys Gly Asp His Pro His Ser 
305 310 315 320 

Val Ser Gly He Glu Pro Leu Leu Tyr Asp Asp Ser Met Arg Pro Leu 
325 330 335 

Phe Val Gly Glu Arg Thr Asn Val He Gly Ser Arg Lys Phe Lys Arg 
340 345 350 

Leu He Glu Glu Glu Lys Tyr Glu Glu Ala Ser Glu He Ala Arg Ser 
355 360 365 

Gin Val Lys Lys Gly Ala His Val He Asp Val Cys Leu Ala Asp Pro 
370 375 380 

Asp Arg Asp Glu Met Glu Asp Met Glu Glu Phe Leu Lys Phe Val He 
385 390 395 4 00 

Asn Lys Val Lys Val Pro Leu Met He Asp Ser Thr Asp Glu Lys Val 
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405 410 415 

He Glu Gin Ala Leu Thr Tyr Ser Gin Gly Lys Ala He He Asn Ser 
420 425 430 

He Asn Leu Glu Asp Gly Glu Glu Arg Phe Glu Lys Val Val Pro Leu 
435 440 445 

Val His Lys Tyr Gly Ala Ala Val Val Val Gly Thr He Asp Glu Glu 
450 455 460 

Gly Met Ala He Thr Ala Glu Lys Lys Leu Ala Val Ala Lys Arg Ser 
465 470 475 460 

Tyr Asp Leu Leu Val Asn Lys Tyr Asn lie Arg Pro Ser Asp He lie 
485 490 495 

Phe Asp Pro Leu Val Phe Pro Val Gly Thr Gly Asp Glu Gin Tyr lie 
500 505 510 

Gly Ser Ala Asn Glu Thr Val Glu Gly lie Arg Arg He Lys Gly Glu 
515 520 " 525 

Leu Pro Glu Cys Leu Thr lie Leu Gly Val Ser Asn Val Ser Phe Gly 
530 535 540 

Leu Pro Pro Val Gly Arg Glu Val Leu Asn Ala Ala Tyr Leu Tyr His 
545 550 555 560 

Cys Thr Gin Ala Gly Leu Asp Tyr Ala He Val Asn Thr Glu Lys Leu 
565 570 575 

Glu Arg Tyr Ala Ser lie Ser Asp Glu Glu Lys Glu Leu Ser Arg Lys 
5B0 5B5 590 

Leu Leu Phe Glu Thr Thr Asp Glu Thr Leu Ala Glu Phe Thr Ala Phe 
595 600 605 

Tyr Arg Gly Lys Lys Ala Glu Lys Lys Val Glu Thr Ser Asn Leu Thr 
610 615 620 

Leu Glu Glu Arg Leu Ala Asn Tyr lie Val Glu Gly Ser Lys Asp Gly 
625 630 635 640 

Leu Thr Glu Asp Leu Asp Lys Ala Leu Ala Lys Tyr Asp Asp Pro Leu 
645 650 655 

Asp lie lie Asn Gly Pro Leu Met Asn Gly Met Asp Glu Val Gly Arg 
660 665 ^ 670 

Leu Phe Asn Asn Asn Glu Leu lie Val Ala Glu Val Leu Gin Ser Ala 
675 680 685 

Glu Val Met Lys Ala Ser Val Ala His Leu Glu Pro His Met Glu Lys 
690 695 700 

Lys Ala Asp Asp His Gly Lys Gly Lys He lie Leu Ala Thr Val Lys 
705 710 715 720 

Gly Asp Val Hie Asp He Gly Lys Asn Leu Val Glu He lie Leu Ser 
725 730 735 
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Asn Asn Gly Phe Arg He Val Asn Leu Gly He Lys Val Thr Ser Asn 
740 745 750 

Glu Leu He Glu Ala Val Ala Arg Glu Asn Pro Asp Ala He, Gly Leu 
755 760 765 

Ser Gly Leu Leu Val Lys Ser Ala Gin Gin Met Val Leu Thr Ala Gin 
770 775 780 

Asp Leu Lys Gin Gin Gin He Ser He . Pro He Leu Val Gly Gly Ala 
785 790 795 800 

Ala Leu Thr Arg LyB Phe Thr Asn Thr Lys He Ala Pro Glu Tyr Asp 
805 810 815 

Gly Leu Val Val Tyr Ala Lys Asp Ala Met Asn Gly Leu Glu Leu Ala 
820 825 830 

Asn Lys Leu Met Lys Pro Asp Glu Arg Glu Lys Leu Ala Val Ser Leu 
835 840 ~ 845 

His Glu Ala Lys Glu Gin Ala Asn Ser Arg Thr Gin Met Gly Gly Gly 
850 855 860 

Gly Thr Ala Val Ala Val Lys Pro Thr Arg Ser His Val Ser Thr Thr 
865 870 875 880 

Val Pro Val Ala Val Pro Pro Asp Val Lys Pro His He Lex* Arg His 
885 890 895 

His Ser He Ala His Leu Glu Pro Tyr He Asn Met Gin Met Leu Leu 
900 905 910 

Gly Arg His Leu Gly Leu Gin Gly Lys Val Ser Arg Leu Leu Ala Glu 
915 920 925 

Lys Asp Glu Lys Ala Leu Glu Leu Lys Glu Lys Val Asp Ala Leu Leu 
930 935 940 

Thr Arg Val Lys Glu Glu Gin Leu Met Glu Ala His Gly Met Tyr Gin 
945 950 955 960 

Phe Phe Pro Ala Gin Ser Asp Gly Asp Asp He Val He Tyr Asp Gin 
965 970 975 

Thr Gly Thr Asn Glu He Glu Arg Phe His Phe Pro Arg Gin Asn Lys 
980 985 990 

Glu Pro Tyr Leu Cys Leu Ala Asp Phe Leu Arg Pro Val Ser Ser Gly 
995 1000 1005 

Glu Met Asp Tyr Val Gly Phe Leu Ala Val Thr Ala Gly Lys Gly He 
1010 1015 1020 

Arg Glu Leu Gly Glu Gin Ala Lys Glu Ala Gly Asp Tyr Leu Phe Ser 
1025 1030 1035 1040 

His Leu He Gin Ala Thr Ala Leu Glu Met Ala Glu Gly Phe Ala Glu 
1045 1050 1055 

Arg Val HiB Gin Leu Met Arg Asp Lys Trp Gly Phe Pro Asp Ser Ala 
1060 1065 1070 
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Asp Phe Thr Met Glu Glu Arg Phe Ala Ala Lys Tyr Arg Qly lie Arg 
1075 1080 1085 

Val Ser Phe Gly Tyr Pro Ala Cys Pro Asp Leu Asp Asp Gin Ala Lys 
1090 1095 lioo 

Leu Phe Lys Leu Leu Lys Pro Gly Lys lie Gly He Glu Leu Thr Glu 
H05 1110 1H5 H20 

Gly Phe Met Met Glu Pro Glu Ala Ser Val Thr Ala Met Val Phe Ala 
1125 H30 H35 

His Pro Glu Ala Arg Tyr Phe Asn Val Leu 
1140 1145 



.<210> 13 
<211> 3411 
<212> DNA 

<213> Bacillus stearothermophilus 

<220> 
<221> CDS 
<222> (1) (3408) 
<223> RBE02044 

<400> 13 

atg get aac gtc acc tta gaa cag caa ctg caa aga aaa att ctt gtc 48 

Met Ala Asn Val Thr Leu Glu Gin Gin Leu Gin Arg Lys He Leu Val 
1 5 10 15 

ate gat ggc gee atg ggc acg atg ate caa age gee aac eta teg gee 96 
He Asp Gly Ala Met Gly Thr Met He Gin Ser Ala Asn Leu Ser Ala 
20 25 30 

gee gac ttt ggc ggc gag gcg tat gaa ggg tgc aac gaa tat ttg acc 144 
Ala Asp Phe Gly Gly Glu Ala Tyr Glu Gly Cys Asn Glu Tyr Leu Thr 
35 40 * 45 

etc acc gee ccg cat gtc ate cgc cgc att cat gaa gcg tac eta gaa 192 
Leu Thr Ala Pro His Val He Arg Arg He His Glu Ala Tyr Leu Glu 
50 55 60 

gee ggt get gat ate att gaa acg aac acg ttc gga gcg aca cgc ate 240 
Ala Gly Ala Asp He He Glu Thr Asn Thr Phe Gly Ala Thr Arg He 
65 70 75 80 

gtg ctt gac gaa tat ggc etc ggt cat ttg gcg ctt gag ctg aac ate 288 
Val Leu Asp Glu Tyr Gly Leu Gly His Leu Ala Leu Glu Leu Asn lie 
85 90 95 

gaa gcg gec aaa etc gee aaa caa acg get gag teg ttc tec acc ccg 336 
Glu Ala Ala Lys Leu Ala Lys Gin Thr Ala Glu Ser Phe Ser Thr Pro 
!00 105 no 

gac tgg ccg cgc ttt gtc gee ggt teg atg ggg ccg acg acg aaa acg 384 
Asp Trp Pro Arg Phe Val Ala Gly Ser Met Gly Pro Thr Thr Lys Thr 
115 120 125 

ttg teg gtg aca ggc ggc gca acg ttt gaa gaa etc gtc gec gee tac 432 
Leu Ser Val Thr Gly Gly Ala Thr Phe Glu Glu Leu Val Ala Ala Tyr 
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130 135 UO 

gaa gaa caa gcg cgc gga ctg etc tta gga ggc gtc gac ctt etc eta 4 80 
Glu Glu Gin Ala Arg Gly Leu Leu Leu Gly Gly Val Asp Leu, Leu Leu 
145 150 155 160 

etc gag acg tgc caa gat acg ctg aat gtc aaa gee ggt ttt etc ggc 528 
Leu Glu Thr CyB Gin Asp Thr Leu Aan Val Lys Ala Gly Phe Leu Gly 
165 170 175 . 

att teg aag gcg ttt gaa gcg gtc ggc cgc cgc gtg ccg etc atg att 576 
lie Ser Lys Ala Phe Glu Ala Val Gly Arg Arg Val Pro Leu Met He 
180 185 ^ 190 

tec ggc acg ate gaa ccg atg ggc acg acg etc gee ggg cag gcg ate 624 
Ser Gly Thr He Glu Pro Met Gly Thr Thr Leu Ala Gly Gin Ala He 
195 200 205 

gat gcg ttt ttc ate teg gtg cgc cat atg aag ccg ate gee gtc ggc 672 
Asp Ala Phe Phe lie Ser Val Arg His Met Lys Pro He Ala Val Gly 
210 215 220 

tta aac tgc gca acc ggt ccg gag ttt atg ace gac cat ttg cgc acg 720 
Leu Asn Cys Ala Thr Gly Pro Glu Phe Met Thr Asp His Leu Arg Thr 
225 230 235 240 

etc gec teg etc get gac acg gcg gtc age tgc tac ccg aac gec ggt 768 
Leu Ala Ser Leu Ala Asp Thr Ala Val Ser Cys Tyr Pro Asn Ala Gly 
245 250 255 

ctg ccg gat gag gaa ggc cac tat cat gaa acg ccg aat atg ctg gca 816 
Leu Pro Asp Glu Glu Gly His Tyr His Glu Thr Pro Asn Met Leu Ala 
260 265 270 

gag aaa ate cgc cgc ttt gee gaa aag gga tgg ate aac ate gtc ggc 864 
Glu Lys He Arg Arg Phe Ala Glu Lys Gly Trp He Asn He Val Gly 
275 280 285 

ggg tgt tgc ggc acg acg ccg gat cat ate cgc gee att get gaa gcg 912 
Gly Cys Cys Gly Thr Thr Pro Asp His He Arg Ala He Ala Glu Ala 
290 295 300 

gtg cgt gat etc ccg ccg egg gcg att ccg tct teg ttt gat gtc cac 960 
Val Arg Asp Leu Pro Pro Arg Ala He Pro Ser Ser Phe Asp Val His 
305 310 315 320 

gee gtt tec ggc ate gag gcg etc ate tat gat gaa acg atg cgc ccg 1008 
Ala Val Ser Gly He Glu Ala Leu He Tyr Asp Glu Thr Met Arg Pro 
325 330 335 

etc ttt gtc ggc gag egg aca aac gtg ate ggc teg cgc aaa ttc aag 1056 
Leu Phe Val Gly Glu Arg Thr Asn Val lie Gly Ser Arg Lys Phe Lys 
340 345 350 

cgc etc ate gee gaa ggg aaa tac gaa gaa gcg gcg gaa ate gec cgc 1104 
Arg Leu He Ala Glu Gly Lys Tyr Glu Glu Ala Ala Glu He Ala Arg 
355 360 365 

gee caa gtg aaa aac ggc gee cat gtc ate gac att tgc etc gec gac 1152 
Ala Gin Val Lys Asn Gly Ala His Val He Asp He Cys Leu Ala Asp 
370 375 380 
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cca gac cgc gac gaa etc cat gac atg gag cag ttc gtc cgc gaa gtc 1200 

Pro Asp Arg Asp Glu Leu His Asp Met Glu Gin Phe Val Arg Glu Val 
385 390 395 400 

gtg aaa aaa gtg aaa gtg ccg ctt gtc ate gat teg ace gac gag cgc 1248 
Val Lys Lys Val Lys Val Pro Leu Val He Asp Ser Thr Asp Glu Arg 
405 410 415 

gtc ate gaa cgc gec ctt acg tat teg caa ggg aag gcg ate ate aac 1296 
Val lie Glu Arg Ala Leu Thr Tyr Ser Gin Gly Lys Ala He He Asn 
420 425 430 

teg ate aac etc gaa gat ggc gaa gag egg ttt gcg aag gtc gtt cct 1344 
Ser He Asn Leu Glu Asp Gly Glu Glu Arg Phe Ala Lys Val Val Pro 
435 440 445 

etc ctg cat caa tac ggc gee gee gtt gtc gtc ggc acg ate gat gag 1392 
Leu Leu His Gin Tyr Gly Ala Ala Val Val Val Gly Thr He Asp Glu 
450 455 460 

caa gga atg gcg gtt aca gee gaa egg aaa ttg gaa ate gec ttg cgt 1440 
Gin Gly Met Ala Val Thr Ala Glu Arg Lys Leu Glu He Ala Leu Arg 
465 470 475 480 

teg tat gac ttg ctg gtg aac cgc tac ggc gtc ccc gag cgc gac ate 1488 
Ser Tyr Asp Leu Leu Val Asn Arg Tyr Gly Val Pro Glu Arg Asp He 
485 490 495 

att ttc gac ccg etc gtc ttc ccg gtc ggc ace ggc gat gag caa tac 1536 
He Phe Asp Pro Leu Val Phe Pro Val Gly Thr Gly Asp Glu Gin Tyr 
500 505 510 

ate ggc gcg gcg aaa gaa ace att gag ggc ate cgc etc att aaa gag 1584 
He Gly Ala Ala Lys Glu Thr He Glu Gly He Arg Leu He Lys Glu 
515 520 525 

egg ctg cct cat tgc ttg acg atg ctt ggc ate age aac gtc teg ttc 1632 
Arg Leu Pro His Cys Leu Thr Met Leu Gly He Ser Asn Val Ser Phe 
530 535 540 

ggc ttg ccg ccg gee gga cgc gag gtg etc aac tec gtc ttt ttg tac 1680 
Gly Leu Pro Pro Ala Gly Arg Glu Val Leu Asn Ser Val Phe Leu Tyr 
545 550 555 560 

cat tgc acg caa gee ggg etc gat tac gee ate gtc aac acc gag aaa 1728 
His Cys Thr Gin Ala Gly Leu Asp Tyr Ala He Val Asn Thr Glu Lys 
565 570 575 

ttg gag egg ttc gee teg att ccg gaa gag gaa gtg cga atg get gag 1776 
Leu Glu Arg Phe Ala Ser He Pro Glu Glu Glu Val Arg Met Ala Glu 
580 585 590 



gca ctt ctt ttt gac aca aac gac gaa aca tta aac gee ttt ate gaa 
Ala Leu Leu Phe Asp Thr Asn Asp Glu Thr Leu Asn Ala Phe He Glu 
595 600 605 



1824 



ttt tac cga age aaa ate acc gee gec aaa ccg gcg cag acg aac ttg 1872 
Phe Tyr Arg Ser Lys He Thr Ala Ala Lys Pro Ala Gin Thr Asn Leu 
610 615 620 

age ttg gaa gag egg etc gee cgc tac gtt att gaa ggg teg aaa gac 1920 
Ser Leu Glu Glu Arg Leu Ala Arg Tyr Val He Glu Gly Ser LyB Asp 
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625 630 635 640 

ggg etc att etc gat ttg gaa aag gcg ctt gag acc tac tec gat ccg 1968 
Gly Leu He Leu Asp Leu Glu Lys Ala Leu Glu Thr Tyr Ser Asp Pro 
645 650 * 655 

ctg tec ate ate aac ggt ccg etc atg gcc ggc atg gat gaa gtc ggg 2016 
Leu Ser He He Asn Gly Pro Leu Met Ala Gly Met Asp Glu Val Gly 
660 665 670 

egg ctg ttc aac aac aac cag etc ate gtc get gaa gta ttg caa age 2064 
Arg Leu Phe Asn Asn Asn Gin Leu He Val Ala Glu Val Leu Gin Ser 
675 680 685 

gcg gaa gtg atg aaa gca gcg gtc gcc ttt tta gag ctg tat atg gaa 2112 
Ala Glu Val Met Lys Ala Ala Val Ala Phe Leu Glu Leu Tyr Met Glu 
690 695 700 

aag aaa gaa gga age aca aaa gga aaa gtc att etc gcc acc gtc aaa 2160 
Lys Lys Glu Gly Ser Thr Lys Gly Lys Val He Leu Ala Thr Val Lys 
705 710 715 720 

ggc gat gtg cat gac ate ggc aaa aac ttg gtc gac ate att tta age 2208 
Gly Asp Val His Asp He Gly Lys Asn Leu Val Asp He He Leu Ser 
725 730 735 

aac aac ggc tac gag gtg ate gac etc ggc att aaa gtc get ccg cag 2256 
Asn Asn Gly Tyr Glu Val He Asp Leu Gly He Lys Val Ala Pro Gin 
740 745 750 

caa etc att gaa gcg gtg cgc gaa cat cag ccg gac ate ate ggg ttg 2304 
Gin Leu He Glu Ala Val Arg Glu His Gin Pro Asp He lie Gly Leu 
755 760 765 

teg ggc ttg ctt gtg aaa teg get caa cag atg gtc gtc acc gcc caa 2352 
Ser Gly Leu Leu Val Lys Ser Ala Gin Gin Met Val Val Thr Ala Gin 
770 775 780 

gac ttg cgc caa gcg ggc ate teg acc ccg att tta gtc ggc ggc gcc 2400 
Asp Leu Arg Gin Ala Gly He Ser Thr Pro He Leu Val Gly Gly Ala 
785 790 795 " 800 

gcc ttg acg cgc aaa ttt acg gaa aac aaa ate gcg ccc gag tac gac 2448 
Ala Leu Thr Arg Lys Phe Thr Glu Asn Lys He Ala Pro Glu Tyr Asp 
805 810 815 

ggc gtt gtc ttg tac gcg aaa gac gcc atg gac ggg etc gcc ctt gcc 2496 
Gly Val Val Leu Tyr Ala Lys Asp Ala Met Asp Gly Leu Ala Leu Ala 
820 825 830 

aac caa ate cag cag ggc gag att gac tac aag aaa aaa gaa acg gcc 2544 
Asn Gin He Gin Gin Gly Glu He Asp Tyr Lys Lys Lys Glu Thr Ala 
835 840 * 845 

gaa age gag cca acg egg caa acg acg gtg gtc aca gcg gtc aaa teg 2592 
Glu Ser Glu Pro Thr Arg Gin Thr Thr Val Val Thr Ala Val Lys Ser 
850 855 860 

acc gtc teg acc gac gtt ccc gtc tac ate ccg gcc gat etc gag cgc 2640 
Thr Val Ser Thr Asp Val Pro Val Tyr He Pro Ala Asp Leu Glu Arg 
865 870 875 880 
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cac gcg ctg cga aat gtg ccg ctt gac cac att ttg ccg tac gtc aac 2688 
His Ala Leu Arg Asn Val Pro Leu Asp His He Leu Pro Tyr Val Asn 
885 890 895 

tgg caa atg gtg etc ggc cac cac etc ggc ttg aaa gga aaa gtg aaa 2736 
Trp Gin Met Val Leu Gly His Hie Leu Gly Leu Lys Gly Lys Val Lye 
900 905 * 910 

egg ctg ctt gaa gag aaa gac gaa aaa gcg ttg gcg tta aaa gcg gtc 2784 
Arg Leu Leu Glu Glu Lys Asp Glu Lys Ala Leu Ala Leu Lys Ala Val 
915 920 925 

gtc gac gaa ctg etc gec gaa gcg aaa gag cgc cgc tgg att cag ccc 2832 
Val Asp Glu Leu Leu Ala Glu Ala Lys Glu Arg Arg Trp He Gin Pro 
930 935 940 

gec ggc gtc tac cgc ttc ttc ccg gcg caa age gac ggc aac egg gtt 2880 
Ala Gly Val Tyr Arg Phe Phe Pro Ala Gin Ser Asp Gly Asn Arg Val 
945 950 955 * 960 

tac att tac gat ccg act gac ggc aaa aca gtg etc gag atg ttc gac 2928 
Tyr He Tyr Asp Pro Thr Asp Gly Lys Thr Val Leu Glu Met Phe Asp 
965 970 975 

ttt ccg cgc caa ccg egg gcg ccg tat ctt tgc etc gee gat tat ttg 2976 
Phe Pro Arg Gin Pro Arg Ala Pro Tyr Leu Cys Leu Ala Asp Tyr Leu 
980 985 990 

aaa teg aaa gaa age ggc gaa atg gat tac gtc ggt ttg ttc gee gtc 3024 
Lys Ser Lys Glu Ser Gly Glu Met Asp Tyr Val Gly Leu Phe Ala Val 
995 1000 1005 

acc get ggg cat ggc gtc cgc gaa etc gee cag cgc tgg aag gaa gaa 3072 
Thr Ala Gly His Gly Val Arg Glu Leu Ala Gin Arg Trp Lys Glu Glu 
1010 1015 1020 

ggc gaa ttt ttg aaa age cat gee ate caa gcg ttg gcg etc gag att 3120 
Gly Glu Phe Leu Ly B Ser His Ala He Gin Ala Leu Ala Leu Glu He 
102 5 1030 1035 1040 

gec gaa ggg ttc gee gaa cga ate cat caa att atg cgc gac cgc tgg 3168 
Ala Glu Gly Phe Ala Glu Arg He His Gin He Met Arg Asp Arg Trp 
1045 1050 1055 

ggc ttc ccg gac gac ccg gat ttc acg atg gaa gag cgc ttc gee gee 3216 
Gly Phe Pro Asp Asp Pro Asp Phe Thr Met Glu Glu Arg Phe Ala Ala 
1060 1065 1070 



aaa tac cag ggc cag cgc tac teg ttc ggc tac ccg gee tgt ccg aac 3264 
Lys Tyr Gin Gly Gin Arg Tyr Ser Phe Gly Tyr Pro Ala Cys Pro Asn 
1075 1080 1085 

ttg gaa gac cag gag aaa ctg ttc cgt ctg ctt cat cca gaa gac ate 3312 
Leu Glu Asp Gin Glu Lys Leu Phe Arg Leu Leu His Pro Glu Asp He 
1090 1095 iioo 

ggc ate cgt etc acc gac ggc tat atg atg gaa ccc gaa gca teg gtt 3360 
Gly He Arg Leu Thr Asp Gly Tyr Met Met Glu Pro Glu Ala Ser Val 
H05 1110 ins n2o 

teg gcg ate gtc ttc gec cat ccg gaa gcg egg tat ttc aat gtg tta 3408 
Ser Ala He Val Phe Ala His Pro Glu Ala Arg Tyr Phe Asn Val Leu 
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1125 



1130 



1135 



taa 



3411 



<210> 14 
<211> 1136 
<212> PRT 

<213> Bacillus stearothermophllus 
<400> 14 

Met Ala Asn Val Thr Leu Glu Gin Gin Leu Gin Arg Lye lie Leu Val 
1 5 10 is 

lie Asp Gly Ala Met Gly Thr Met He Gin Ser Ala Asn Leu Ser Ala 



Ala Asp Phe Gly Gly Glu Ala Tyr Glu Gly Cys Asn Glu Tyr Leu Thr 
35 40 45 

Leu Thr Ala Pro His Val He Arg Arg He HIb Glu Ala Tyr Leu Glu 
50 55 60 

Ala Gly Ala Asp He He Glu Thr Asn Thr Phe Gly Ala Thr Arg He 
65 70 75 BO 

Val Leu Asp Glu Tyr Gly Leu Gly His Leu Ala Leu Glu Leu Asn He 
85 90 95 

Glu Ala Ala Lys Leu Ala Lye Gin Thr Ala Glu Ser Phe Ser Thr Pro 
100 105 HO 

Asp Trp Pro Arg Phe Val Ala Gly Ser Met Gly Pro Thr Thr Lys Thr 
115 120 125 

Leu Ser Val Thr Gly Gly Ala Thr Phe Glu Glu Leu Val Ala Ala Tyr 
130 135 140 

Glu Glu Gin Ala Arg Gly Leu Leu Leu Gly Gly Val Asp Leu Leu Leu 
145 150 155 160 

Leu Glu Thr Cys Gin Asp Thr Leu Asn Val Lys Ala Gly Phe Leu Gly 
165 170 175 

He Ser Lys Ala Phe Glu Ala Val Gly Arg Arg Val Pro Leu Met He 
180 185 190 

Ser Gly Thr He Glu Pro Met Gly Thr Thr Leu Ala Gly Gin Ala He 
195 200 205 

Asp Ala Phe Phe He Ser Val Arg His Met Lys Pro He Ala Val Gly 
210 215 220 

Leu ABn Cys Ala Thr Gly Pro Glu Phe Met Thr Asp His Leu Arg Thr 
225 230 235 240 

Leu Ala Ser Leu Ala Asp Thr Ala Val Ser Cy8 Tyr Pro Asn Ala Gly 
245 250 * 255 

Leu Pro Asp Glu Glu Gly His Tyr His Glu Thr Pro Asn Met Leu Ala 



20 



25 



30 



260 



265 



270 
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Glu Lye He Arg Arg Phe Ala Glu Lys Gly Trp He Asn He Val Gly 
275 280 * 285 

Gly Cys Cys Gly Thr Thr Pro Asp His lie Arg Ala He Ala Glu Ala 
290 295 300 

Val Arg Asp Leu Pro Pro Arg Ala He Pro Ser Ser Phe Asp Val His 
305 310 315 320 

Ala Val Ser Gly He Glu Ala Leu He Tyr Asp Glu Thr Met Arg Pro 
325 330 335 

Leu Phe Val Gly Glu Arg Thr Ash Val He Gly Ser Arg Lys Phe Lys 
340 345 350 

Arg Leu He Ala Glu Gly Lys Tyr Glu Glu Ala Ala Glu He Ala Arg 
355 360 365 

Ala Gin Val Lys Asn Gly Ala His Val He Asp He Cys Leu Ala Asp 
370 375 380 

Pro Asp Arg Asp Glu Leu His Asp Met Glu Gin Phe Val Arg Glu Val 
385 390 395 400 

Val Lys Lys Val Lys Val Pro Leu Val He Asp Ser Thr Asp Glu Arg 
405 410 415 

Val He Glu Arg Ala Leu Thr Tyr Ser Gin Gly Lys Ala He He Asn 
420 425 430 

Ser He Asn Leu Glu Asp Gly Glu Glu Arg Phe Ala Lys Val Val Pro 
435 440 445 

Leu Leu His Gin Tyr Gly Ala Ala Val Val Val Gly Thr He Asp Glu 
450 455 460 

Gin Gly Met Ala Val Thr Ala Glu Arg Lys Leu Glu He Ala Leu Arg 
465 470 475 480 

Ser Tyr Asp Leu Leu Val Asn Arg Tyr Gly Val Pro Glu Arg Asp He 
485 490 495 

He Phe Asp Pro Leu Val Phe Pro Val Gly Thr Gly Asp Glu Gin Tyr 
500 50S 510 

He Gly Ala Ala Lys Glu Thr He Glu Gly lie Arg Leu He Lys Glu 
515 520 525 

Arg Leu Pro His Cys Leu Thr Met Leu Gly He Ser Asn Val Ser Phe 
530 535 540 

Gly Leu Pro Pro Ala Gly Arg Glu Val Leu Asn Ser Val Phe Leu Tyr 
545 550 555 560 

His Cys Thr Gin Ala Gly Leu Asp Tyr Ala He Val Asn Thr Glu Lys 
565 570 575 

Leu Glu Arg Phe Ala Ser He Pro Glu Glu Glu Val Arg Met Ala Glu 
580 585 590 

' Ala Leu Leu Phe Asp Thr Asn Asp Glu Thr Leu Asn Ala Phe He Glu 
-595 600 605 
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Phe Tyr Arg Ser Lye lie Thr Ala Ala Lys Pro Ala Gin Thr Aan Leu 
610 615 620 

Ser Leu Glu Glu Arg Leu Ala Arg Tyr Val He Glu Gly Ser LyB Asp 
625 630 635 • 640 

Gly Leu He Leu Asp Leu Glu Lya Ala Leu Glu Thr Tyr Ser Asp Pro 
645 650 655 

• > i< 
Leu Ser He He Asn Gly Pro Leu Met Ala Gly Met Asp Glu Val Gly 
660 665 670 

Arg Leu Phe Asn Asn Asn Gin Leu He Val Ala Glu Val Leu Gin Ser 
675 6B0 685 

Ala Glu Val Net Lys Ala Ala Val Ala Phe Leu Glu Leu Tyr Met Glu 
690 695 700 

Lys Lys Glu Gly Ser Thr Lys Gly Lys Val He Leu Ala Thr Val Lys 
705 710 715 720 

Gly Asp Val His Asp He Gly Lys Asn Leu Val Asp He He Leu Ser 
725 730 735 

Asn Asn Gly Tyr Glu Val He Asp Leu Gly He Lys Val Ala Pro Gin 
740 745 750 

Gin Leu He Glu Ala Val Arg Glu His Gin Pro Asp He lie Gly Leu 
755 760 765 

Ser Gly Leu Leu Val Lys Ser Ala Gin Gin Met Val Val Thr Ala Gin 
770 775 780 

Asp Leu Arg Gin Ala Gly He Ser Thr Pro He Leu Val Gly Gly Ala 
785 790 795 800 

Ala Leu Thr Arg Lys Phe Thr Glu Asn Lys He Ala Pro Glu Tyr Asp 
805 810 815 

Gly Val Val Leu Tyr Ala Lys Asp Ala Met Asp Gly Leu Ala Leu Ala 
820 825 830 

Asn Gin He Gin Gin Gly Glu He Asp Tyr Lys Lys Lys Glu Thr Ala 
835 840 845 

Glu Ser Glu Pro Thr Arg Gin Thr Thr Val Val Thr Ala Val Lys Ser 
850 855 860 

Thr Val Ser Thr Asp Val Pro Val Tyr lie Pro Ala Asp Leu Glu Arg 
865 870 875 880 

His Ala Leu Arg Asn Val Pro Leu Asp His He Leu Pro Tyr Val Asn 
885 890 895 

Trp Gin Met Val Leu Gly His His Leu Gly Leu Lys Gly Lys Val Lys 
900 905 910 

Arg Leu Leu Glu Glu Lys Asp Glu Lys Ala Leu Ala Leu Lys Ala Val 
915 920 925 

Val Asp Glu Leu Leu Ala Glu Ala Lys Glu Arg Arg Trp He Gin Pro 
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930 935 940 

Ala Gly Val Tyr Arg Phe Phe Pro Ala Gin Ser Asp Gly Asn Arg Val 
94 S 950 955 * 960 

Tyr He Tyr Asp Pro Thr Asp Gly Lys Thr Val Leu Glu Met Phe Asp 
965 970 975 

Phe Pro Arg Gin Pro Arg Ala Pro Tyr Leu Cys Leu Ala Asp Tyr Leu 
960 985 990 

Lys Ser Lys Glu Ser Gly Glu Met A8p Tyr Val Gly Leu Phe Ala Val 
995 1000 . 1005 

Thr Ala Gly His Gly Val Arg Glu Leu Ala Gin Arg Trp Lys Glu Glu 
1010 1015 1020 

Gly Glu Phe Leu Lys Ser His Ala He Gin Ala Leu Ala Leu Glu He 
102 5 1030 1035 1040 

Ala Glu Gly Phe Ala Glu Arg He His Gin He Met Arg Asp Arg Trp 
1045 1050 " 1055 

Gly Phe Pro Asp Asp Pro Asp Phe Thr Met Glu Glu Arg phe Ala Ala 
1060 1065 1070 

Lys Tyr Gin Gly Gin Arg Tyr Ser Phe Gly Tyr Pro Ala Cys Pro Asn 
1075 1080 1085 

Leu Glu Asp Gin Glu Lys Leu Phe Arg Leu Leu His Pro Glu Asp He 
1090 1095 iioo 

Gly He Arg Leu Thr Asp Gly Tyr Met Met Glu Pro Glu Ala Ser Val 
1105 mo ins H20 

Ser Ala He Val Phe Ala His Pro Glu Ala Arg Tyr Phe Asn Val Leu 
1125 H30 1135 



<210> 15 
<211> 3681 
<212> DNA 

<213> Vibrio cholerae 

<220> 
<221> CDS 
<222> (1) (3678) 
<223> RVC04265 

<400> 15 

gtg gga aaa gaa gta aga caa caa etc gaa cag caa ttg aaa caa cgt 48 

val Gly Lye Glu Val Arg Gin Gin Leu Glu Gin Gin Leu Lys Gin Arg 
15 io is 

ate eta ctg att gat ggt ggt atg ggt acc atg att cag agt tat aag 96 
He Leu Leu He Asp Gly Gly Met Gly Thr Met He Gin Ser Tyr Lys 
20 25 30 



tta caa gag gaa gac tat cgc ggt gca cga ttt gtc gat tgg cac tgt 144 



WO 03/087386 PCT/EP03/04010 

61 

Leu Gin Glu Glu Asp Tyr Arg Gly Ala Arg Phe Val Asp Trp His Cys 
35 40 45 

gat ttg aaa gga aat aac gac etc tta gtg ctt act cag ccg, caa att 192 

Asp Leu Lys Gly Asn Asn Asp Leu Leu Val Leu Thr Gin Pro' Gin He 
50 55 60 



att aaa gag att cac tec get tac ctt gaa gcg ggg gcg gat att ctt 
He Lys Glu He His Ser Ala Tyr Leu Glu Ala Gly Ala Asp He Leu 
65 70 75 



80 



240 



gag ace aac ace ttt aac tea acc acg att gee atg gca gac tat gac 288 
Glu Thr Asn Thr Phe Asn Ser Thr Thr He Ala Met Ala Asp Tyr Asp 
85 90 95 

atg caa teg etc agt get gaa att aac ttt gee gcg get aag ctt gca 336 
Met Gin Ser Leu Ser Ala Glu He Asn Phe Ala Ala Ala Lys Leu Ala 
100 105 no 

cgt gaa gtc gcg gat gag tgg acg get aaa gat cca agt egg cca cgc 384 
Arg Glu Val Ala Asp Glu Trp Thr Ala Lys Asp Pro Ser Arg Pro Arg 
115 120 125 

tat gtg get ggt gtg ctt ggg cca acc aac cgt act tgc tct att teg 432 
Tyr Val Ala Gly Val Leu Gly Pro Thr Asn Arg Thr Cys Ser He Ser 
130 135 140 

cca gat gtg aac gat cca gga ttt cgt aac gtc act ttt gat ggg ctt 480 
Pro Asp Val Asn Asp Pro Gly Phe Arg Asn Val Thr Phe Asp Gly Leu 
"5 150 155 * 160 

gtt gaa gee tat tec gaa teg acg cgc get ttg ate aaa ggt ggc age 528 
Val Glu Ala Tyr Ser Glu Ser Thr Arg Ala Leu He Lys Gly Gly Ser 
165 170 175 

gat ctg ate etc att gaa acc ate ttc gat aca ctt aac gee aaa gec 576 
Asp Leu He Leu He Glu Thr He Phe Asp Thr Leu Asn Ala Lys Ala 
180 185 190 

tgt gcg ttt gcg gtc gat age gta ttt gaa gag ctg ggc ate age tta 624 
Cys Ala Phe Ala Val Asp Ser Val Phe Glu Glu Leu Gly He Ser Leu 
195 200 205 

cct gtg atg att tec ggc acg att acc gat gee tct ggg cga act ctg 672 
Pro Val Met He Ser Gly Thr He Thr Asp Ala Ser Gly Arg Thr Leu 
210 215 220 

tea gga cag aca acg gaa get ttc tac aac gee ttg cgt cat gta egg 720 
Ser Gly Gin Thr Thr Glu Ala Phe Tyr Asn Ala Leu Arg His Val Arg 
225 230 235 240 

ccg att teg ttt ggc ttg aac tgt gcg tta ggt cct gat gag ctg cgc 768 
Pro He Ser Phe Gly Leu Asn Cys Ala Leu Gly Pro Asp Glu Leu Arg 
245 250 255 

cag tac gtg gaa gag ctt tea cgc att tea gaa tgc tat gtt tec gcg 816 
Gin Tyr Val Glu Glu Leu Ser Arg He Ser Glu Cys Tyr Val Ser Ala 
260 265 " 270 

cac cca aat gee gga ctg ccc aat gcg ttt ggt gaa tac gat etc tct 864 
His Pro Asn Ala Gly Leu Pro Asn Ala Phe Gly Glu Tyr Asp Leu Ser 
275 280 285 
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gcc gag gaa atg gca gaa cat att gcg gaa tgg gca caa get ggc ttt 912 
Ala Glu Glu Met Ala Glu His He Ala Glu Trp Ala Gin Ala Gly Phe 
290 295 300 

ttg aat ttg gtc ggt ggt tgc tgt gga act aca cct gag cat ate gcc 960 
Leu Asn Leu Val Gly Gly Cys Cys Gly Thr Thr Pro Glu His He Ala 
305 310 315 320 

gcc att gcc aaa gcc gtc gag ggt gta aaa cca agg get ctg cca gat 1008 
Ala He Ala Lys Ala Val Glu Gly Val Lys Pro Arg Ala Leu Pro Asp 
325 330 335 

ctg aaa gta gaa tgt cgt etc teg ggt tta gag ccg etc aat att ggt 1056 
Leu Lys Val Glu Cys Arg Leu Ser Gly Leu Glu Pro Leu Asn He Gly 
340 345 350 

cct gaa ace ttg ttt gtt aac gtg ggc gaa cgt act aac gtc ace ggt 1104 
Pro Glu Thr Leu Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly 
355 360 365 

tct gcg cgt ttt aag cgt tta att aaa gaa gag caa tac gac gaa gcg 1152 
Ser Ala Arg Phe Lys Arg Leu He Lys Glu Glu Gin Tyr Asp Glu Ala 
370 375 380 

etc gat gtg gcg cgt gag caa gtc gaa aac ggc gcg cag ate att gat 1200 
Leu Asp Val Ala Arg Glu Gin Val Glu Asn Gly Ala Gin He He Asp 
385 390 395 400 

ate aac atg gat gaa ggc atg ttg gac gcc gag gcg tgt atg gtg cgc 1248 
He Asn Met Asp Glu Gly Met Leu Asp Ala Glu Ala Cys Met Val Arg 
405 410 415 

ttt ttg aat eta tgc gcc tct gaa cca gaa ata tec aaa gtt ccg gtg 1296 
Phe Leu Asn Leu Cys Ala Ser Glu Pro Glu He Ser Lys Val Pro Val 
420 425 430 

atg gtc gac tec tct aaa tgg gaa gtc att gaa gcg ggt ctg aaa tgc 1344 
Met Val Asp Ser Ser Lys Trp Glu Val He Glu Ala Gly Leu Lys Cys 
435 440 445 

att cag ggt aaa ggc ate gtc aac tct ate tct eta aaa gaa ggg aaa 1392 
He Gin Gly Lys Gly He Val Asn Ser He Ser Leu LyB Glu Gly Lys 
450 455 460 

gag aag ttt att gcc caa gcc aaa ttg gtg cgc cgc tac ggt gcc gcg 1440 
Glu Lys Phe He Ala Gin Ala Lys Leu Val Arg Arg Tyr Gly Ala Ala 
465 470 475 " 480 

gtg att gtg atg gca ttt gac gaa gtg ggc caa gcc gat ace cgt gag 14 88 
Val He Val Met Ala Phe Asp Glu Val Gly Gin Ala Asp Thr Arg Glu 
485 490 495 

cgc aaa tta gag ate tgt cgt egg get tac cat att ttg gtc gat gag 1536 
Arg Lys Leu Glu He Cys Arg Arg Ala Tyr His He Leu Val Asp Glu 
500 505 510 

gtg ggc ttc cca ccg gaa gat att att ttt gac ccg aac ate ttt get 1584 
Val Gly Phe Pro Pro Glu Asp He He Phe Asp Pro Asn He Phe Ala 
515 520 525 

gtt gcg ace gga att gat gag cac aat aac tac gca ctg gat ttc att 1632 
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Val Ala Thr Gly He Asp Glu His Asn Asn Tyr Ala Leu Asp Phe He 
530 535 540 

aat gca gtg gcg gac att aag cgt gag ctg ccg cat gcg atg. att tct 1680 
Asn Ala Val Ala Asp He Lys Arg Glu Leu Pro His Ala Net He Ser 
545 550 555 560 

ggc ggt gtt tct aac gtt tec ttc tct ttc cgc ggc aac aac tat gtg 1728 
Gly Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Asn Tyr Val 
565 570 , 575 

cgt gaa gcg ate cat get gtt ttc ctt tat cac tgc ttc aaa cac ggc 1776 
Arg Glu Ala He His Ala Val Phe Leu Tyr His Cys Phe Lys His Gly 
580 585 590 

atg gac atg ggg att gtc aac gca ggg cag ctt gaa ate tac gat aac 1824 
Net Asp Net Gly He Val Asn Ala Gly Gin Leu Glu He Tyr Asp Asn 
595 600 605 

gtt ccg ctg aaa ctg cgt gag gca gtg gaa gat gtg ate etc aat cga 1872 
Val Pro Leu Lys Leu Arg Glu Ala Val Glu Asp Val He Leu Asn Arg 
610 615 620 

cgt age gat ggc acg gaa aga ctg ctt gag ate gee gaa gcg tat cgc 1920 
Arg Ser Asp Gly Thr Glu Arg Leu Leu Glu He Ala Glu Ala Tyr Arg 
625 630 635 640 

gaa aac agt gtt ggt aaa gaa gag gat get tct gca tta gag tgg cgc 1968 
Glu Asn Ser Val Gly Lys Glu Glu Asp Ala Ser Ala Leu Glu Trp Arg 
645 650 655 

gca tgg cct gtg get aag cgc eta gag cac get tta gtc aaa ggc ate 2016 
Ala Trp Pro Val Ala Lys Arg Leu Glu His Ala Leu Val Lys Gly He 
660 665 670 

ace gaa ttt ate gtc caa gac act gaa gaa gca cgt cag caa gec agt 2064 
Thr Glu Phe He Val Gin Asp Thr Glu Glu Ala Arg Gin Gin Ala Ser 
675 660 685 

aaa cca ctg gaa gtg att gaa ggg ccg ctg atg gat ggt atg aac gtg 2112 
Lys Pro Leu Glu Val He Glu Gly Pro Leu Met Asp Gly Met Asn Val 
690 695 700 

gtc ggt gac ttg ttc ggg gaa ggg aaa atg ttc eta ccg caa gtc gta 2160 
Val Gly Asp Leu Phe Gly Glu Gly Lys Met Phe Leu Pro Gin Val Val 
705 710 715 720 

aaa tea gcg cgt gtc atg aaa caa gec gtt gcg tat ctt gag cct ttc 2208 
Lys Ser Ala Arg Val Met Lys Gin Ala Val Ala Tyr Leu Glu Pro Phe 
725 730 735 

att aat gcg caa aaa agt ggt age act tea aat ggt aag att ttg ctg 2256 
He Asn Ala Gin Lys Ser Gly Ser Thr Ser Asn Gly Lys He Leu Leu 
740 745 750 

gcg ace gta aaa ggc gat gtg cat gac att ggt aag aac att gtt ggc 2304 
Ala Thr Val Lys Gly Asp Val His Asp He Gly Lys Asn He Val Gly 
755 760 765 



gtc gtg ctg cag tgt aat aac ttc gag ate ate gat ctt ggt gtg atg 
val Val Leu Gin Cys Asn Asn Phe Glu He He Asp Leu Gly Val Met 
770 775 780 



2352 



I 
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gtg cct tgc gag cag ate etc aaa gtc gca cgc gag caa aat gtc gat 
Val Pro Cys Glu Gin He Leu Lys Val Ala Arg Glu Gin Asn Val Asp 
785 730 795 800 



2400 



ate ate ggt etc tct ggg ctt ate acg ccg tct ttg gat gag atg gta 
He He Gly Leu Ser Gly Leu He Thr Pro Ser Leu Asp Glu Met Val 
805 810 ~ 815 



2448 



cac gtg gcg aaa gag atg gag cga caa ggg ttt gaa ctg cca ctt ttg 
His Val Ala Lys Glu Met Glu Arg Gin Gly Phe Glu Leu Pro Leu Leu 
820 825 830 



2496 



att ggt ggg gca aca acg tct aaa gcg cat act gcg gtg aag att gaa 
He Gly Gly Ala Thr Thr Ser Lys Ala HiB Thr Ala Val Lys He Glu 
835 840 845 



2544 



cag aat tat cat gcg cct gta gtg tac gtg aat aac gcg teg cgc gcg 
Gin Asn Tyr His Ala Pro Val Val Tyr Val Asn Asn Ala Ser Arg Ala 
850 855 860 



2592 



gta ggg gtg tgc aca tea tta ttg tct gat gaa cag cgc ccc gga ttt 
Val Gly Val Cys Thr Ser Leu Leu Ser Asp Glu Gin Arg Pro Gly Phe 
865 870 875 880 



2640 



ate gaa cgt ttg gat etc gat tat gag cgc acg cgt gat cag cat get 
He Glu Arg Leu Asp Leu Asp Tyr Glu Arg Thr Arg Asp Gin His Ala 
885 890 * 895 



2688 



cgt aaa acg ccc aaa teg cgc cca gtc acg tta gag cag gca cgt get 
Arg Lys Thr Pro Lys Ser Arg Pro Val Thr Leu Glu Gin Ala Arg Ala 
900 905 910 



2736 



aat aaa gcg gcg ctg gat tgg gca aat tac acg ccg ccc get cct gcg 
Asn Lys Ala Ala Leu Asp Trp Ala Asn Tyr Thr Pro Pro Ala Pro Ala 
915 920 925 



2784 



aaa ccg ggt gtg cat gtg ttt gaa aac att gcg tta gee aca eta cgt 
Lys Pro Gly Val His Val Phe Glu Asn He Ala Leu Ala Thr Leu Arg 
930 935 940 



2832 



cct tat ate gat tgg acg cct ttt ttt atg act tgg teg ctt atg ggc 
Pro Tyr He Asp Trp Thr Pro Phe Phe Met Thr Trp Ser Leu Met Gly 
945 950 955 960 



2880 



aaa tac cct gee att ttg gag cat gaa gag gtc ggt gaa gag gee aaa 
Lys Tyr Pro Ala He Leu Glu His Glu Glu Val Gly Glu Glu Ala Lys 
965 970 975 



2928 



cgt ctg ttt cat gat gee aat gec tta ctt gat aaa gta gag cga gaa 2976 

Arg Leu Phe His Asp Ala ABn Ala Leu Leu Asp Lys Val Glu Arg Glu 

980 985 990 

gga eta ctg aaa gee agt ggt atg tgt gca ctg ttt cca gca gee age 3024 

Gly Leu Leu Lys Ala Ser Gly Met Cys Ala Leu Phe Pro Ala Ala Ser 

995 1000 1005 



gtg ggc gat gac att gag gtg tac agt gat gaa teg cgt acg caa gtc 
Val Gly Asp Asp He Glu Val Tyr Ser Asp Glu Ser Arg Thr Gin Val 
1010 1015 1020 



3072 



gcg cat gtg ctg tac aac ttg cgt cag cag act gag aaa ccg aaa ggg 3120 
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Ala His Val Leu Tyr Asn Leu Arg Gin Gin Thr Glu Lye Pro Lys Gly 
1025 1030 1035 1040 

gcc aac tac tgt ttg teg gac tat gtt get ccg aaa gag age, ggt aaa 
Ala' Asn Tyr Cys Leu Ser Asp Tyr Val Ala Pro Lys Glu Ser Gly Lys 
1045 1050 1055 



3166 



cgt gat tgg att ggc gcg ttt gca gta act ggt ggc att ggt gag cga 
Arg Asp Trp lie Gly Ala Phe Ala Val Thr Gly Gly lie Gly Glu Arg 
1060 1065.. 1070 



3216 



gcc ttg gcc gat get tat aaa get cag ggt gat gat tac aat gcg ate 
Ala Leu Ala Asp Ala Tyr Lys Ala Gin Gly Asp Asp Tyr Asn Ala He 
1075 1080 1085 



3264 



atg ate caa gcg gta gcc gat cgt ttg gcg gaa gcc ttt gcg gaa tat 
Met He Gin Ala Val Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Tyr 
1090 1095 1100 



3312 



ctg cat gaa aaa gtg cgt aaa gag att tgg ggt tat gcg age gat gaa 
Leu His Glu Lys Val Arg Lys Glu He Trp Gly Tyr Ala Ser Asp Glu 
1105 1110 1115 1120 



3360 



aat etc tec aat gat gae ctg ate cgt gag cgt tat cag ggc att cga 
Asn Leu Ser Asn Asp Asp Leu He Arg Glu Arg Tyr Gin Gly He Arg 
1125 1130 1135 



3408 



ccc gcg ccg ggg tat ccc gcg tgt cct gag cat acc gag aaa, gcg act 
Pro Ala Pro Gly Tyr Pro Ala Cys Pro Glu His Thr Glu Lys Ala Thr 
1140 1145 1150 



3456 



ttg tgg cag atg eta aat gtc gaa gag acc ata ggt atg tea ctg acc 
Leu Trp Gin Met Leu Asn Val Glu Glu Thr He Gly Met Ser Leu Thr 
1155 1160 1165 



3504 



aca age tat gcg atg tgg ccg ggc get teg gta tec ggt tgg tat ttc 
Thr Ser Tyr Ala Met Trp Pro Gly Ala Ser Val Ser Gly Trp Tyr Phe 
1170 1175 HBO 



3552 



teg cat ccc gat tct cgc tat ttt gcg gta gcg cag ate caa cca gat 3600 
Ser His Pro Asp Ser Arg Tyr Phe Ala Val Ala Gin He Gin Pro Asp 
1185 1190 1195 1200 

caa ctg cac age tac get gag cgt aaa ggt tgg cgt ttg gaa gaa get 364 8 
Gin Leu His Ser Tyr Ala Glu Arg Lys Gly Trp Arg Leu Glu Glu Ala 
1205 1210 1215 



gaa aag tgg eta gcg cct aac ctt gat get taa 
Glu Lys Trp Leu Ala Pro Asn Leu Asp Ala 
1220 1225 



3681 



<210> 16 
<211> 1226 
<212> PRT 

<213> Vibrio cholerae 
<400> 16 

Val Gly Lys Glu Val Arg Gin Gin Leu Glu Gin Gin Leu Lys Gin Arg 
15 10 15 



He Leu Leu He Asp Gly Gly Met Gly Thr Met He Gin Ser Tyr Lys 
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20 



25 



30 



Leu Gin Glu Glu Asp Tyr Arg Gly Ala Arg Phe Val Asp Trp His Cys 
35 40 45 

Asp Leu Lys Gly Asn Asn Asp Leu Leu Val Leu Thr Gin Pro Gin He 
50 55 60 

He Lys Glu He His Ser Ala Tyr Leu Glu Ala Gly Ala Asp He Leu 
6 5 70 75 80 

Glu Thr Asn Thr Phe Asn Ser Thr Thr He Ala Met Ala Asp Tyr Asp 
85 90 95 

Met Gin Ser Leu Ser Ala Glu He Asn Phe Ala Ala Ala Lys Leu Ala 
100 105 no 

Arg Glu Val Ala Asp Glu Trp Thr Ala Lys Asp Pro Ser Arg Pro Arg 
115 120 125 

Tyr Val Ala Gly Val Leu Gly Pro Thr Asn Arg Thr Cys Ser He Ser 
130 135 140 

Pro Asp Val Asn Asp Pro Gly Phe Arg Asn Val Thr Phe Asp Gly Leu 
145 150 155 160 

Val Glu Ala Tyr Ser Glu Ser Thr Arg Ala Leu He Lys Gly Gly Ser 
165 170 175 

Asp Leu He Leu He Glu Thr He Phe Asp Thr Leu Asn Ala Lys Ala 
180 185 190 

Cys Ala Phe Ala Val Asp Ser Val Phe Glu Glu Leu Gly He Ser Leu 
195 200 205 

Pro Val Met He Ser Gly Thr He Thr Asp Ala Ser Gly Arg Thr Leu 
210 215 220 

Ser Gly Gin Thr Thr Glu Ala Phe Tyr Asn Ala Leu Arg His Val Arg 
225 230 235 ~ 240 

Pro He Ser Phe Gly Leu Asn Cys Ala Leu Gly Pro Asp Glu Leu Arg 
245 250 255 

Gin Tyr Val Glu Glu Leu Ser Arg He Ser Glu Cys Tyr Val Ser Ala 
260 265 270 

His Pro Asn Ala Gly Leu Pro Asn Ala Phe Gly Glu Tyr Asp Leu Ser 
275 280 285 

Ala Glu Glu Met Ala Glu His He Ala Glu Trp Ala Gin Ala Gly Phe 
290 295 300 

Leu Asn Leu Val Gly Gly Cys Cys Gly Thr Thr Pro Glu His He Ala 
305 310 315 320 

Ala He Ala Lys Ala Val Glu Gly Val Lys Pro Arg Ala Leu Pro Asp 
325 330 335 

Leu Lye Val Glu Cys Arg Leu Ser Gly Leu Glu Pro Leu Asn He Gly 



340 



345 



350 
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Pro Glu Thr Leu Phe Val Asn Val Qly Qlu Arg Thr Asn Val Thr Gly 
355 360 365 

Ser Ala Arg Phe Lys Arg Leu lie Lys Glu Glu Gin Tyr Asp Glu Ala 
370 375 380 

Leu Asp Val Ala Arg Glu Gin Val Glu Asn Gly Ala Gin lie lie Asp 
385 390 395 400 

lie Asn Met Asp Glu Gly Met Leu Asp Ala Glu Ala Cys Met Val Arg 
405 410 ' "" 415 

Phe Leu Asn Leu Cys Ala Ser Glu Pro Glu He Ser Lys Val Pro Val 
420 425 430 

Met Val Asp Ser Ser Lys Trp Glu Val He Glu Ala Gly Leu Lys Cys 
435 440 445 

He Gin Gly Lys Gly He Val Asn Ser He Ser Leu Lys Glu Gly Lys 
450 455 460 

Glu Lys Phe He Ala Gin Ala Lys Leu Val Arg Arg Tyr Gly Ala Ala 
465 470 475 480 

Val He Val Met Ala Phe Asp Glu Val Gly Gin Ala Asp Thr Arg Glu 
485 490 495 

Arg Lys Leu Glu He Cys Arg Arg Ala Tyr His He Leu Val Asp Glu 
500 505 510 

Val Gly Phe Pro Pro Glu Asp He He Phe Asp Pro Aen He Phe Ala 
515 520 525 

Val Ala Thr Gly He Asp Glu His Asn Asn Tyr Ala Leu Asp Phe He 
530 535 540 

Asn Ala Val Ala Asp He Lys Arg Glu Leu Pro His Ala Met He Ser 
54 5 550 555 560 

Gly Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Asn Tyr Val 
565 570 " 575 

Arg Glu Ala He His Ala Val Phe Leu Tyr His Cys Phe Lys His Gly 
580 585 590 

Met Asp Met Gly He Val Asn Ala Gly Gin Leu Glu He Tyr Asp Asn 
595 600 60S 

Val Pro Leu Lys Leu Arg Glu Ala Val Glu Asp Val He Leu Asn Arg 
610 615 620 

Arg Ser Asp Gly Thr Glu Arg Leu Leu Glu He Ala Glu Ala Tyr Arg 
625 630 635 640 

Glu Asn Ser Val Gly Lys Glu Glu Asp Ala Ser Ala Leu Glu Trp Arg 
645 650 655 

Ala Trp Pro Val Ala Lys Arg Leu Glu His Ala Leu Val Lys Gly He 
660 665 670 

Thr Glu Phe He Val Gin Asp Thr Glu Glu Ala Arg Gin Gin Ala Ser 
675 680 685 
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Lys Pro Leu Glu Val He Glu Gly Pro Leu Met Asp Gly Met Ash Val 
690 695 700 

Val Gly Asp Leu Phe Gly Glu Gly Lye Met Phe Leu Pro Gin Val Val 
7 05 710 715 720 

Lys Ser Ala Arg Val Met Lys Gin Ala Val Ala Tyr Leu Glu Pro Phe 
725 730 735 

He Asn Ala Gin Lys Ser Gly Ser Thr Ser Asri Gly Lys He Leu Leu 
740 745 750 

Ala Thr Val Lys Gly Asp Val His Asp He Gly Lys Asn He Val Gly 
755 760 765 

Val Val Leu Gin Cys Asn Asn Phe Glu He He Asp Leu Gly Val Met 
770 775 780 

Val Pro Cys Glu Gin He Leu Lys Val Ala Arg Glu Gin Asn Val Asp 
785 790 795 800 

He He Gly Leu Ser Gly Leu He Thr Pro Ser Leu Asp Glu Met Val 
805 810 815 

His Val Ala Lys Glu Met Glu Arg Gin Gly Phe Glu Leu Pro Leu Leu 
820 825 830 

He Gly Gly Ala Thr Thr Ser Lys Ala His Thr Ala Val Lys He Glu 
835 840 845 

Gin Asn Tyr His Ala Pro Val Val Tyr Val Asn Asn Ala Ser Arg Ala 
850 855 860 

Val Gly Val Cys Thr Ser Leu Leu Ser Asp Glu Gin Arg Pro Gly Phe 
865 870 875 880 

He Glu Arg Leu Asp Leu Asp Tyr Glu Arg Thr Arg Asp Gin His Ala 
885 890 895 

Arg Lys Thr Pro Lys Ser Arg Pro Val Thr Leu Glu Gin Ala Arg Ala 
900 905 910 

Asn Lys Ala Ala Leu Asp Trp Ala Asn Tyr Thr Pro Pro Ala Pro Ala 
915 920 925 

Lys Pro Gly Val His Val Phe Glu Asn He Ala Leu Ala Thr Leu Arg 
930 935 940 

Pro Tyr He Asp Trp Thr Pro Phe Phe Met Thr Trp Ser Leu Met Gly 
945 950 955 960 

Lys Tyr Pro Ala He Leu Glu His Glu Glu Val Gly Glu Glu Ala Lys 
965 970 975 

Arg Leu Phe His Asp Ala Asn Ala Leu Leu Asp Lys Val Glu Arg Glu 
980 985 990 

Gly Leu Leu Lys Ala Ser Gly Met Cys Ala Leu Phe Pro Ala Ala Ser 
995 1000 1005 

Val Gly Asp Asp He Glu Val Tyr Ser Asp Glu Ser Arg Thr Gin Val 
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1010 1015 1020 

Ala His Val Leu Tyr Asn Leu Arg Gin Gin Thr Glu Lys Pro Lye Gly 
1025 1030 1035 4 1040 

Ala Asn Tyr Cys Leu Ser Asp Tyr Val Ala Pro Lys Glu Ser Gly Lys 
1045 1050 1055 

Arg Asp Trp lie Gly Ala Phe Ala Val Thr Gly Gly He Gly Glu Arg 
1060 1065 107p 

Ala Leu Ala Asp Ala Tyr Lys Ala Gin Gly Asp Asp Tyr Asn Ala He 
1075 1080 1085 

Met He Gin Ala Val Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Tyr 
1090 1095 1100 

Leu His Glu Lys Val Arg Lys Glu He Trp Gly Tyr Ala Ser Asp Glu 
1105 1110 1H5 1120 

Asn Leu Ser Asn Asp Asp Leu He Arg Glu Arg Tyr Gin Gly He Arg 
H25 1130 1135 

Pro Ala Pro Gly Tyr Pro Ala Cys Pro Glu His Thr Glu Lys Ala Thr 
1140 1145 H50 

Leu Trp Gin Met Leu Asn Val Glu Glu Thr He Gly Met Ser Leu Thr 
1155 1160 1165 

Thr Ser Tyr Ala Met Trp Pro Gly Ala Ser Val Ser Gly Trp Tyr Phe 
1170 1175 1180 

Ser His Pro Asp Ser Arg Tyr Phe Ala Val Ala Gin He Gin Pro Asp 
H85 1190 1195 1200 

Gin Leu His Ser Tyr Ala Glu Arg Lys Gly Trp Arg Leu Glu Glu Ala 
1205 1210 1215 

Glu Lys Trp Leu Ala Pro Asn Leu Asp Ala 
1220 1225 



<210> 17 
<211> 3822 
<212> DNA 

<213> Sinorhizobiura meliloti 

<220> 
<221> CDS 
<222> (1) (3819) 
<223> RSM07338 

<400> 17 

gtg agt aaa teg ata att ctt tgt cgt ttt cag aac ggg aga tct ccc 48 

Val Ser Lys Ser He He Leu Cys Arg Phe Gin Asn Gly Arg Ser Pro 
1 5 10 15 

atg tec gec gee gac gee etc ttt gga aac gtc teg ccc aag ccg gat 96 
Met Ser Ala Ala Asp Ala Leu Phe Gly Asn Val Ser Pro Lys Pro Asp 
20 25 30 



ggt teg gaa gtc ttt egg cag etc gec cag gcg gcg get gaa cgc ate 144 
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Gly Ser Glu Val Phe Arg Gin Leu Ala Gin Ala Ala Ala Glu Arg He 
35 40 45 

etc ate atg gat ggc gec atg gga acg gag ate cag cag etc ggt ttc 192 
Leu He Met Asp Gly Ala Met Gly Thr Glu He Gin Gin Leu Gly Phe 
50 55 60 

gtg gag gat cac ttc cgc ggc gag cgc ttc ggt ggc tgc gec tgc cat 240 
Val Glu Asp His Phe Arg Gly Glu Arg Phe Gly Gly Cys Ala Cys His 
fi 5 70 75 80 

cag cag ggc aac aac gac etc ctg acg etc act cag ccg aag gcg ate 288 
Gin Gin Gly Asn Asn Asp Leu Leu Thr Leu Thr Gin Pro Lys Ala lie 
85 90 95 

gag gat att cat tac cac tac gec ate gec ggc gee gat ate etc gaa 336 
Glu Asp He His Tyr His Tyr Ala He Ala Gly Ala Asp He Leu Glu 
100 105 no 

acc aac acc ttc tec teg acg egg ate gee cag gee gat tac ggc atg 384 
Thr Asn Thr Phe Ser Ser Thr Arg He Ala Gin Ala Asp Tyr Gly Met 
115 120 125 

gag gac atg gtc tac gat etc aat cgc gac ggc gcg egg ctg gcg egg 432 
Glu Asp Met Val Tyr Asp Leu Asn Arg Asp Gly Ala Arg Leu Ala Arg 
130 135 140 

cga gec gcg aag egg gee gag gcg gag gat ggc egg egg cgc ttc gtg 480 
Arg Ala Ala Lys Arg Ala Glu Ala Glu Asp Gly Arg Arg Arg Phe Val 
145 150 155 160 

gca ggc gcg etc ggc ccc acc aac cgc acc get teg att teg ccg gac 528 
Ala Gly Ala Leu Gly Pro Thr Asn Arg Thr Ala Ser He Ser Pro Asp 
165 170 175 

gtc aac aac ccc ggc tat cga gee gtc age ttc gac gat ctg agg etc 576 
Val ABn Asn Pro Gly Tyr Arg Ala Val Ser Phe Asp Asp Leu Arg Leu 
180 185 * 190 

gee tat gee gag cag gtg egg ggc etc ate gac ggc ggt gec gac ate 624 
Ala Tyr Ala Glu Gin Val Arg Gly Leu He Asp Gly Gly Ala Asp He 
195 200 205 

ate ctg ate gag acg ate ttc gac acg ctg aat gee aag gcg gcg ate 672 
He Leu He Glu Thr He Phe Asp Thr Leu Asn Ala Lys Ala Ala He 
210 215 220 

ttc gcg acg cag gaa gtc ttt gee gaa aag ggc gtc cgc ctt ccg gtg 720 
Phe Ala Thr Gin Glu Val Phe Ala Glu Lys Gly Val Arg Leu Pro Val 
225 230 23S 240 

atg ate tec gga acg ate acc gat etc tec ggc cgt acc etc tec ggc 768 
Met He Ser Gly Thr He Thr Asp Leu Ser Gly Arg Thr Leu Ser Gly 
245 250 ~ 255 

cag acg cct acg gec ttc tgg tat teg gtg cgc cat gcg gat ccg ttt 816 
Gin Thr Pro Thr Ala Phe Trp Tyr Ser Val Arg His Ala Asp Pro Phe 
260 265 270 



acg ate ggg etc aac tgc gcg etc ggc gca aat gcg atg cgc gec cat 
Thr He Gly Leu Asn Cys Ala Leu Gly Ala Asn Ala Met Arg Ala His 
275 280 285 



B64 
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ata gac gag ctt teg gcg gtc gec gac acg etc gtc tgc gec tat ccg 9X2 
He Asp Glu Leu Ser Ala Val Ala Asp Thr Leu Val Cys Ala Tyr Pro 
290 295 300 , 

aat gee ggc ctg ccg aac gag ttc ggc cgc tat gac gaa age ccc gag 960 
Asn Ala Gly Leu Pro Asn Glu Phe Gly Arg Tyr Asp Glu Ser Pro Glu 
305 310 315 320 



cag atg gcg gcg cag gtc gag ggc ttc. gec egg gac ggt etc gtc aac 
Gin Met Ala Ala Gin Val Glu Gly Phe Ala Arg Asp Gly Leu Val Asn 
325 330 335 



1008 



1200 



ate gtc ggc ggc tgc tgc ggt tec acg ccg gee cat ate cgc gee att 1056 
He Val Gly Gly Cys Cys Gly Ser Thr Pro Ala His lie Arg Ala He 
340 345 350 

gee gaa gcg gtt gec aaa tat ccg ccg cgc egg gtg ccc gag ate gat 1104 
Ala Glu Ala Val Ala Lys Tyr Pro Pro Arg Arg Val Pro Glu He Asp 
355 360 365 

cgc cgc atg egg ctt tec ggc etc gaa ccc ttc acg ctt ace gac gag 1152 
Arg Arg Met Arg Leu Ser Gly Leu Glu Pro Phe Thr Leu Thr Asp Glu 
370 375 380 

att ccc ttc gtc aac gtc ggc gaa cgc acc aac gtc ace ggc teg gcg 
He Pro Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Ala 
385 390 395 400 

aag ttc cgc aag ctg ate acc gee ggg gac tac gec gec gca etc gat 1248 
LyB Phe Arg Lys Leu He Thr Ala Gly Asp Tyr Ala Ala Ala Leu Asp 
405 410 415 

gtg gcg cgt gat cag gtg gcg aat ggc gec cag ate ate gac gtc aac 1296 
Val Ala Arg Asp Gin Val Ala Asn Gly Ala Gin lie lie Asp Val Asn 
420 425 430 

atg gac gaa ggc ctg ate gat teg aag cag gtg atg gtc gag ttc ctg 1344 
Met Asp Glu Gly Leu He Asp Ser Lys Gin Val Met Val Glu Phe Leu 
435 440 445 

aac etc gtc gee tec gag ccg gat ate gec cgt gta ccg gtg atg ate 1392 
Asn Leu Val Ala Ser Glu Pro Asp He Ala Arg Val Pro Val Met lie 
450 455 460 

gat teg teg aaa tgg gag gtg ate gaa gec ggg etc aaa tgc gtc cag 1440 
Asp Ser Ser LyB Trp Glu Val He Glu Ala Gly Leu Lys Cys Val Gin 
465 470 475 480 

ggc aag gcg ctg gtg aac tec ate teg etc aag gaa ggc gag gcg get 1488 
Gly Lys Ala Leu Val Asn Ser He Ser Leu Lys Glu Gly Glu Ala Ala 
485 490 495 

ttc ctg cac cat gcg cgc etc gtg cgc gec tat ggc gee gcg gtc gtg 1536 
Phe Leu His His Ala Arg Leu Val Arg Ala Tyr Gly Ala Ala Val Val 
500 505 510 

gtg atg gcg ttc gac gag aag ggc cag gec gac acg aaa acc cgc aag 1584 
Val Met Ala Phe Asp. Glu Lys Gly Gin Ala Asp Thr Lys Thr Arg Lys 
515 520 525 

gtg gaa ate tgc egg egg gec tat egg ctg ctg acg gaa gag gtt ggc 1632 
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Val Glu He Cys Arg Arg Ala Tyr Arg Leu Leu Thr Glu Glu Val Gly 
530 535 540 

ttc ccc ccg gag gac ate ate ttc gac ccg aat ate ttc gcg gtc gcg 1680 
Phe Pro Pro Glu Asp He He Phe Asp Pro Asn He Phe Ala Val Ala 
545 550 555 560 

ace ggc ate gag gag cac aac aat tac ggc gtc gac ttc ate gag gcg 1728 
Thr Gly He Glu Glu His Asn Asn Tyr Gly Val Asp Phe He Glu Ala 
565 570 575 

acg cac gag ate ate gcg gca ctg ccg cat gtc cac gtc tec ggc ggc 1776 
Thr His Glu He He Ala Ala Leu Pro His Val His Val Ser Gly Gly 
580 585 590 

gtg teg aac etc tec ttt tec ttc cgc ggc aac gag ccg gtg cgc gag 1824 
Val Ser Asn Leu Ser Phe Ser Phe Arg Gly Asn Glu Pro Val Arg Glu 
595 600 605 

gcg atg cac gee ate ttc ctt tat cac gcg ate cag gee ggc atg gac 1872 
Ala Met His Ala He Phe Leu Tyr His Ala He Gin Ala Gly Met Asp 
610 615 620 

atg ggc ate gtc aat gee gga cag etc gee gtc tat gat gcg ate gac 1920 
Met Gly He Val Asn Ala Gly Gin Leu Ala Val Tyr Asp Ala He Asp 
625 630 635 640 

ccg gaa ctg cgc gaa ace tgc gag gac gtg gtg etc aac cgc egg gee 1968 
Pro Glu Leu Arg Glu Thr Cys Glu Asp Val Val Leu Asn Arg Arg Ala 
645 650 655 

gat teg ace gag cgc etc ctg gag ate gee gag cgc tat cgc ggg aag 2016 
Abp Ser Thr Glu Arg Leu Leu Glu He Ala Glu Arg Tyr Arg Gly Lys 
660 665 670 

ggc ggg age cag ggc aag gag aag gac ctt gee tgg cgc gaa tgg ccg 2064 
Gly Gly Ser Gin Gly Lys Glu Lys Asp Leu Ala Trp Arg Glu Trp Pro 
675 680 685 

gtg gag aag egg etc gaa cac gcg etc gtc aat gga att ace gaa ttt 2112 
Val Glu Lys Arg Leu Glu His Ala Leu Val Asn Gly He Thr Glu Phe 
690 695 700 

ate gaa gee gat acg gaa gag gee egg ctt gec gee gag egg ccg ctg 2160 
He Glu Ala Asp Thr Glu Glu Ala Arg Leu Ala Ala Glu Arg Pro Leu 
705 710 715 720 

cat gtc ate gaa ggc ccg ctg atg gee ggg atg aac gtc gtg ggc gat 2208 
His Val He Glu Gly Pro Leu Met Ala Gly Met Asn Val Val Gly Asp 
725 730 735 

etc ttc ggt tec ggc aag atg ttc ctg ccg cag gtg gtc aag tec gee 2256 
Leu Phe Gly Ser Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala 
740 74S 750 

egg gtg atg aag cag gee gtt gcg gtg ctg etc ccc cat atg gag gag 2304 
Arg Val Met Lys Gin Ala Val Ala Val Leu Leu Pro His Met Glu Glu 
755 760 765 

gag aag cgc gee aat ggc ggc ggc gag gcg cgc gag agt gee ggc aag 2352 
Glu Lys Arg Ala Asn Gly Gly Gly Glu Ala Arg Glu Ser Ala Gly Lys 
770 775 780 
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ate ctg atg gcg acc gtc aag ggc gac gtg cac gac ate ggc aag aac 2400 
He Leu Met Ala Thr Val Lys Gly Asp Val His Asp He Gly Lye Asn 
785 790 795 4 800 

ate gtc ggc gtc gtg etc gec tgc aac aat tac gag ate ate gac etc 2448 
He Val Gly Val Val Leu Ala Cys Asn Asn Tyr Glu He He Asp Leu 
805 810 815 

ggc gtc atg gtg ccc teg get aag ate etc gaa gtg gcg cgc gaa cag 2496 
Gly Val Met Val Pro Ser Ala Lys He Leu Glu Val Ala Afg Glu Gin 
820 825 830 

aag gtc gac ate gtc ggt ctt tec ggc etc ate acg ccg teg ctg gac 2544 
Lys Val ABp He Val Gly Leu Ser Gly Leu He Thr Pro Ser Leu Asp 
835 840 845 

gag atg gcg cat gtc get tec gag etc gaa egg gag ggc ttc gat gtc 2592 
Glu Met Ala His Val Ala Ser Glu Leu Glu Arg Glu Gly Phe Asp Val 
850 855 860 

ccg ctg ctg ate ggc ggg gcg acg acc age cgc gtg cac acg gee gtg 2640 
Pro Leu Leu He Gly Gly Ala Thr Thr Ser Arg Val His Thr Ala Val 
8«5 870 875 880 

aag ate aat ccg cgt tac age etc ggc cag acg gtc tat gtc acc gac 2688 
Lys He Asn Pro Arg Tyr Ser Leu Gly Gin Thr Val Tyr Val Thr Asp 
885 890 ** 895 

gee age cgc gcg gtc ggc gtc gta teg age ctg etc teg ccg gaa gtc 2736 
Ala Ser Arg Ala Val Gly Val Val Ser Ser Leu Leu Ser Pro Glu Val 
900 905 910 

cgc gac tec tac aag aaa acg gtc cgc gcg gag tat ctg aag gtt gee 2784 
Arg Asp Ser Tyr Lys Lys Thr Val Arg Ala Glu Tyr Leu Lys Val Ala 
915 920 925 

gac gca cat gec cgc aac gaa gee gag aag cgc cgt ctg ccg ctt tec 2832 
Asp Ala His Ala Arg Asn Glu Ala Glu Lys Arg Arg Leu Pro Leu Ser 
930 935 940 

cag gcg egg gcg aat gec ttt egg ata gat tgg gac gee cac cag ccg 2880 
Gin Ala Arg Ala Asn Ala Phe Arg He Asp Trp Asp Ala His Gin Pro 
945 950 955 960 

aag gtt ccg tec ttc etc ggc acg cgt gtt ttc gag gga tgg gac etc 2928 
Lys Val Pro Ser Phe Leu Gly Thr Arg Val Phe Glu Gly Trp Asp Leu 
965 970 975 

gee gaa etc gee cgc tat ate gac tgg acg ccg ttc ttc cag acc tgg 2976 
Ala Glu Leu Ala Arg Tyr He Asp Trp Thr Pro Phe Phe Gin Thr Trp 
980 985 990 

gag ctg aag ggg gta ttc ccg aaa ate etc gat gac gaa cgc cag ggg 3024 
Glu Leu Lys Gly Val Phe Pro Lys He Leu Asp Asp Glu Arg Gin Gly 
995 1000 1005 



get gec get cgc cag etc ttc gag gat gcg cag gcg atg gtc gaa aag 
Ala Ala Ala Arg Gin Leu Phe Glu Asp Ala Gin Ala Met Val Glu Lys 
1010 1015 1020 



3072 



ate gtg gee gag gca tgg ttc gec ccg aag gee gtg ate ggc ttc tgg 3120 
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He Val Ala Glu Ala Trp phe Ala Pro Lye Ala Val He Gly Phe Trp 
1025 1030 103S 1040 



ccg gcc gcc age atg ggc gac gac gtc cgc ctg ttt gec gac gag gtg 
Pro Ala Ala Ser Met Gly Asp Asp Val Arg Leu Phe Ala Abp Glu Val 
1045 1050 1055 



3168 



cgc gaa gcc gag ctt gcc acc ttc ttc acg etc cgc cag cag atg gtg 
Arg Glu Ala Glu Leu Ala Thr Phe Phe Thr Leu Arg Gin Gin Met Val 
1060 1065 ~ 1070 



3216 



aag cgc gac ggc egg ccg aac gtc gcc ctt gcc gac ttc gtc gcc ccg 
Lys Arg Asp Gly Arg Pro Asn Val Ala Leu Ala ABp Phe Val Ala Pro 
1075 1080 1085 



3264 



gcg gcg age ggc aag egg gac tat gtc ggc ggt ttc gtg gtg acg gcc 
Ala Ala Ser Gly Lys Arg Asp Tyr Val Gly Gly Phe Val Val Thr Ala 
1090 1095 1100 



3312 



ggc ate gag gaa gtg gcg ate gcc gaa cgc ttc gaa egg gcg aac gac 
Gly He Glu Glu Val Ala He Ala Glu Arg Phe Glu Arg Ala Asn Asp 
H05 1110 1115 1120 



3360 



gat tat tec teg ate atg gtc aag gcg ctt gcg gac cgc ttc gca gag 
Asp Tyr Ser Ser He Met Val Lys Ala Leu Ala Asp Arg Phe Ala Glu 
1125 1130 1135 



3408 



gcc ttt gcc gag cgc atg cat gaa tat gtc cgc aag gag etc tgg ggc 
Ala Phe Ala Glu Arg Met His Glu Tyr Val Arg Lys Glu Leu Trp Gly 
1140 1145 1150 



3456 



tat get ccg gac gaa gcc ttc acg ccg cag gaa ttg ate gcc gag ccc 
Tyr Ala Pro Asp Glu Ala Phe Thr Pro Gin Glu Leu He Ala Glu Pro 
1155 1160 1165 



3504 



tat gcc ggc ate cgc cct gcg ccc ggc tac ccg gcg cag ccc gac cac 
Tyr Ala Gly He Arg Pro Ala Pro Gly Tyr Pro Ala Gin Pro Asp His 
1170 1175 1180 



3552 



acg gaa aag gag acg ctt ttc egg etc ctg gat gcg gaa gcc get ate 
Thr Glu Lys Glu Thr Leu Phe Arg Leu Leu Asp Ala Glu Ala Ala He 
H85 1190 1195 1200 



3600 



ggc gtc egg etc acc gag age tat gcg atg tgg ccg ggc tct teg gta 
Gly Val Arg Leu Thr Glu Ser Tyr Ala Met Trp Pro Gly Ser Ser Val 
1205 1210 1215 



3648 



teg ggc etc tat gtc ggc cac ccc gat tec tat tac ttc ggc gtc gca 
Ser Gly Leu Tyr Val Gly His Pro Asp Ser Tyr Tyr Phe Gly Val Ala 
1220 1225 1230 



3696 



aag ate gag cgc gat cag gtg gag gac tat gcc gat cgc aag cgc atg 
Lys He Glu Arg Asp Gin Val Glu Asp Tyr Ala Asp Arg Lys Arg Met 
1235 1240 1245 



3744 



age gtc cgc gag gtc gag cgc tgg ctt teg ccg ate etc aat tac gtg 
Ser Val Arg Glu Val Glu Arg Trp Leu Ser Pro He Leu Asn Tyr Val 
1250 1255 1260 



3792 



ccg atg ccg gag acg gaa gcg gcg gag tag 
Pro Met Pro Glu Thr Glu Ala Ala Glu 
1265 1270 



3822 
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<210> 18 
<211> 1273 
<212> PRT 

<213> Sinorhizobium meliloti 
<400> 18 

Val Ser Lye Ser He He Leu Cys Arg Phe Gin Asn Gly Arg Ser Pro 
1 5 10 15 

Met Ser Ala Ala Asp Ala Leu Phe Gly Asn Val Ser Pro Lys Pro Asp 
20 25 30 

Gly Ser Glu Val Phe Arg Gin Leu Ala Gin Ala Ala Ala Glu Arg He 
35 40 45 

Leu He Met Asp Gly Ala Met Gly Thr Glu He Gin Gin Leu Gly Phe 
50 55 60 

Val Glu Asp His Phe Arg Gly Glu Arg Phe Gly Gly Cys Ala Cys His 
6S 70 75 B0 

Gin Gin Gly Asn Asn Asp Leu Leu Thr Leu Thr Gin Pro Lys Ala He 
85 90 95 

Glu Asp He His Tyr His Tyr Ala He Ala Gly Ala Asp He Leu Glu 
100 105 nq 

Thr Asn Thr Phe Ser Ser Thr Arg He Ala Gin Ala Asp Tyr Gly Met 
115 120 125 

Glu Asp Met Val Tyr Asp Leu Asn Arg Asp Gly Ala Arg Leu Ala Arg 
130 135 140 

Arg Ala Ala Lys Arg Ala Glu Ala Glu Asp Gly Arg Arg Arg Phe Val 
145 150 155 160 

Ala Gly Ala Leu Gly Pro Thr Asn Arg Thr Ala Ser He Ser Pro Asp 
165 170 175 

Val Asn Asn Pro Gly Tyr Arg Ala Val Ser Phe Asp Asp Leu Arg Leu 
180 185 190 

Ala Tyr Ala Glu Gin Val Arg Gly Leu He Asp Gly Gly Ala Asp He 
195 200 205 

lie Leu He Glu Thr He Phe Asp Thr Leu Asn Ala Lys Ala Ala He 
210 215 220 

Phe Ala Thr Gin Glu Val Phe Ala Glu Lys Gly Val Arg Leu Pro Val 
225 230 235 240 

Met He Ser Gly Thr He Thr Asp Leu Ser Gly Arg Thr Leu Ser Gly 
245 250 255 

Gin Thr Pro Thr Ala Phe Trp Tyr Ser Val Arg His Ala Asp Pro Phe 
260 265 270 

Thr He Gly Leu Asn Cys Ala Leu Gly Ala Asn Ala Met Arg Ala His 
275 280 285 
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He Asp Glu Leu Ser Ala Val Ala Asp Thr Leu Val Cys Ala Tyr Pro 
290 295 300 

Asn Ala Gly Leu Pro Asn Glu Phe Gly Arg Tyr Asp Glu Ser Pro Glu 
305 310 315 320 

Gin Met Ala Ala Gin Val Glu Gly Phe Ala Arg Asp Gly Leu Val Asn 
325 330 ~ 335 

He Val Gly Gly Cys Cys Gly Ser Thr Pro Ala His He Arg Ala He 
340 345 350 

Ala Glu Ala Val Ala Lys Tyr Pro Pro Arg Arg Val Pro Glu lie Asp 
355 360 " 365 

Arg Arg Met Arg Leu Ser Gly Leu Glu Pro Phe Thr Leu Thr Asp Glu 
370 375 380 

He Pro Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Ala 
385 390 395 400 

Lys Phe Arg Lys Leu He Thr Ala Gly Asp Tyr Ala Ala Ala Leu Asp 
405 410 415 

Val Ala Arg Asp Gin Val Ala Asn Gly Ala Oln He He Asp Val Asn 
420 425 430 

Met Asp Glu Gly Leu He Asp Ser Lys Gin Val Met Val Glu Phe Leu 
435 440 445 

Asn Leu Val Ala Ser Glu Pro Asp He Ala Arg Val Pro Val Met He 
450 455 460 

Asp Ser Ser Lys Trp Glu Val He Glu Ala Gly Leu Lys Cye Val Gin 
465 470 475 480 

Gly Lys Ala Leu Val Asn Ser He Ser Leu Lys Glu Gly Glu Ala Ala 
485 490 495 

Phe Leu His His Ala Arg Leu Val Arg Ala Tyr Gly Ala Ala Val Val 
500 505 510 

Val Met Ala Phe Asp Glu Lys Gly Gin Ala Asp Thr Lys Thr Arg Lys 
515 520 525 

Val Glu He Cys Arg Arg Ala Tyr Arg Leu Leu Thr Glu Glu Val Gly 
530 535 540 

Phe Pro Pro Glu Asp He He Phe Asp Pro Asn He Phe Ala Val Ala 
545 550 555 560 

Thr Gly He Glu Glu His Asn Asn Tyr Gly Val Asp Phe He Glu Ala 
565 570 575 

Thr His Glu He He Ala Ala Leu Pro His Val His Val Ser Gly Gly 
580 585 590 

Val Ser Asn Leu Ser Phe Ser Phe Arg Gly Asn Glu Pro Val Arg Glu 
595 600 * 605 

Ala Met His Ala He Phe Leu Tyr His Ala He Gin Ala Gly Met Asp 
610 615 620 
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Met Gly He Val Aen Ala Gly Gin Leu Ala Val Tyr Asp Ala He Asp 
625 630 635 640 

Pro Glu Leu Arg Glu Thr Cys Glu Asp Val Val Leu Asn Arg Arg Ala 
645 650 6,55 

Asp Ser Thr Glu Arg Leu Leu Glu He Ala Glu Arg Tyr Arg Gly Lys 
660 665 670 

Gly Gly Ser Gin Gly Lys Glu Lys Asp Leu Ala Trp Arg Glu Trp Pro 
675 660 685 

Val Glu Lys Arg Leu Glu His Ala Leu Val Aen Gly He Thr Glu Phe 
690 695 700 

He Glu Ala Asp Thr Glu Glu Ala Arg Leu Ala Ala Glu Arg Pro Leu 
705 710 715 720 

His Val He Glu Gly Pro Leu Met Ala Gly Met Asn Val Val Gly Asp 
725 730 735 

Leu Phe Gly Ser Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala 
740 745 750 

Arg Val Met Lys Gin Ala Val Ala Val Leu Leu Pro His Met Glu Glu 
755 760 765 

Glu Lys Arg Ala Asn Gly Gly Gly Glu Ala Arg Glu Ser Ala Gly Lys 
770 775 780 

He Leu Met Ala Thr Val Lys Gly Asp Val His Asp He Gly Lys Asn 
785 790 795 800 

He Val Gly Val Val Leu Ala Cys Asn Asn Tyr Glu He He Asp Leu 
805 810 815 

Gly Val Met Val Pro Ser Ala Lys lie Leu Glu Val Ala Arg Glu Gin 
820 825 830 

Lys Val Asp He Val Gly Leu Ser Gly Leu He Thr Pro Ser Leu Asp 
835 840 845 

Glu Met Ala His Val Ala Ser Glu Leu Glu Arg Glu Gly Phe Asp Val 
850 855 860 

Pro Leu Leu He Gly Gly Ala Thr Thr Ser Arg Val His Thr Ala Val 
865 870 875 880 

Lys He Asn Pro Arg Tyr Ser Leu Gly Gin Thr Val Tyr Val Thr Asp 
885 890 895 

Ala Ser Arg Ala Val Gly Val Val Ser Ser Leu Leu Ser Pro Glu Val 
900 905 910 

Arg Asp Ser Tyr Lys Lys Thr Val Arg Ala Glu Tyr Leu Lys Val Ala 
915 920 925 

Asp Ala His Ala Arg Asn Glu Ala Glu Lys Arg Arg Leu Pro Leu Ser 
930 935 940 

Gin Ala Arg Ala Asn Ala Phe Arg He Asp Trp Asp Ala His Gin Pro 
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945 950 955 960 

Lys Val Pro Ser Phe Leu Gly Thr Arg Val Phe Glu Gly Trp Asp Leu 
965 970 975 

Ala Glu Leu Ala Arg Tyr lie Asp Trp Thr Pro Phe Phe Gin Thr Trp 
980 985 990 

Glu Leu Lys Gly Val Phe Pro Lys He Leu Asp Asp Glu Arg Gin Gly 
995 1000 1005 

Ala Ala Ala Arg Gin Leu Phe Glu Asp Ala Gin Ala Met Val Glu Lys 
1010 1015 1020 

He Val Ala Glu Ala Trp Phe Ala Pro Lys Ala Val He Gly Phe Trp 
1 025 1030 1035 1040 

Pro Ala Ala Ser Met Gly Asp Asp Val Arg Leu Phe Ala Asp Glu Val 
1045 1050 1055 

Arg Glu Ala Glu Leu Ala Thr Phe Phe Thr Leu Arg Gin Gin Met Val 
1060 1065 " 1070 

Lye Arg Asp Gly Arg Pro Asn Val Ala Leu Ala Asp Phe Val Ala Pro 
1075 1080 1085 

Ala Ala Ser Gly Lys Arg Asp Tyr Val Gly Gly Phe Val Val Thr Ala 
1090 1095 1100 

Gly He Glu Glu Val Ala He Ala Glu Arg Phe Glu Arg Ala Asn Asp 
1105 mo 1U5 U20 

Asp Tyr Ser Ser He Met Val Lys Ala Leu Ala Asp Arg Phe Ala Glu 
1125 H30 H35 

Ala Phe Ala Glu Arg Met His Glu Tyr Val Arg Lys Glu Leu Trp Gly 
1140 H45 * H50 

Tyr Ala Pro Asp Glu Ala Phe Thr Pro Gin Glu Leu He Ala Glu Pro 
1155 H60 H65 

Tyr Ala Gly He Arg Pro Ala Pro Gly Tyr Pro Ala Gin Pro Asp His 
1170 H75 HBO 

Thr Glu Lys Glu Thr Leu Phe Arg Leu Leu Asp Ala Glu Ala Ala He 
H90 H95 1200 

Gly Val Arg Leu Thr Glu Ser Tyr Ala Met Trp Pro Gly Ser Ser Val 
1205 1210 1215 

Ser Gly Leu Tyr Val Gly His Pro Asp Ser Tyr Tyr Phe Gly Val Ala 
1220 1225 1230 

Lys He Glu Arg Asp Gin Val Glu Asp Tyr Ala Asp Arg Lye Arg Met 
1235 1240 124S 

Ser Val Arg Glu Val Glu Arg Trp Leu Ser Pro He Leu Asn Tyr Val 
1250 1255 1260 

Pro Met Pro Glu Thr Glu Ala Ala Glu 
1265 1270 
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<210> 19 
<211> 3684 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 
<222> (1) . . (3681) 
<223> RBC03905 

<400> 19 

gtg age age aaa gtg gaa caa ctg cgt gcg cag tta aat gaa cgt att 48 

Val Ser Ser Lys Val Glu Gin Leu Arg Ala Gin Leu Asn Glu Arg He 

1 5 10 15 

ctg gtg ctg gac ggc ggt atg ggc acc atg ate cag agt tat cga ctg 96 
Leu Val Leu Asp Gly Gly Met Gly Thr Met He Gin Ser Tyr Arg Leu 
20 25 30 

aac gaa gec gat ttt cgt ggt gaa cgc ttt gee gac tgg cca tgc gac 144 
Asn Glu Ala Asp Phe Arg Gly Glu Arg Phe Ala Asp Trp Pro Cys Asp 
35 40 45 

etc aaa ggc aac aac gac ctg ctg gta etc agt aaa ccg gaa gtg ate 192 
Leu Lys Gly Asn Asn Asp Leu Leu Val Leu Ser Lys Pro Glu Val He 
50 55 60 

gee get ate cac aac gee tac ttt gaa gcg ggc gcg gat ate ate gaa 240 
Ala Ala He His Asn Ala Tyr Phe Glu Ala Gly Ala Asp He He Glu 
65 70 75 80 

acc aac acc ttc aac tec acg acc att gcg atg gcg gat tac cag atg 288 
Thr Asn Thr Phe Asn Ser Thr Thr He Ala Met Ala Asp Tyr Gin Met 
85 90 95 

gaa tec ctg teg gcg gaa ate aac ttt gcg gcg gcg aaa ctg gcg cga 336 
Glu Ser Leu Ser Ala Glu He Asn Phe Ala Ala Ala Lys Leu Ala Arg 
100 105 HO 

get tgt get gac gag tgg acc gcg cgc acg cca gag aaa ccg cgc tac 384 
Ala Cys Ala Asp Glu Trp Thr Ala Arg Thr Pro Glu Lys Pro Arg Tyr 
115 120 125 

gtt gee ggt gtt etc ggc ccg acc aac cgc acg gcg tct att tct ccg 432 
Val Ala Gly Val Leu Gly Pro Thr Asn Arg Thr Ala Ser He Ser Pro 
130 135 140 

gac gtc aac gat ccg gca ttt cgt aat ate act ttt gac ggg ctg gtg 480 
Asp Val Asn Asp Pro Ala Phe Arg Asn He Thr Phe Asp Gly Leu Val 
145 150 155 * 160 

gcg get tat cga gag tec acc aaa gcg ctg gtg gaa ggt ggc gcg gat 528 
Ala Ala Tyr Arg Glu Ser Thr Lys Ala Leu Val Glu Gly Gly Ala Asp 
165 170 175 

ctg ate ctg att gaa acc gtt ttc gac acc ctt aac gec aaa gcg gcg 576 
Leu He Leu He Glu Thr Val Phe Asp Thr Leu Asn Ala Lys Ala Ala 
180 185 190 

gta ttt gcg gtg aaa acg gag ttt gaa gcg ctg ggc gtt gag ctg ccg 624 
Val Phe Ala Val Lys Thr Glu Phe Glu Ala Leu Gly Val Glu Leu Pro 
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195 200 205 

att atg ate tec ggc acc ate acc gac gee tec ggg cgc acg etc tec 672 
lie Met He Ser Gly Thr He Thr Asp Ala Ser Gly Arg Thr Leu Ser 
210 215 220 

ggg cag acc acc gaa gca ttt tac aac tea ttg cgc cac gee gaa get 720 
Gly Gin Thr Thr Glu Ala Phe Tyr Asn Ser Leu Arg His Ala Glu Ala 
225 230 235 240 

ctg acc ttt ggc ctg aac tgt gcg ctg ggg ccc gat gaa ctg cgc cag 768 
Leu Thr Phe Gly Leu Asn Cys Ala Leu Gly Pro Asp Glu Leu Arg Gin 
245 250 255 

tac gtg cag gag ctg tea egg att gcg gaa tgc tac gtc acc gcg cac 816 
Tyr Val Gin Glu Leu Ser Arg He Ala Glu Cys Tyr Val Thr Ala His 
260 265 270 

ccg aac gee ggg eta ccc aac gec ttt ggt gag tac gat etc gac gee 864 
Pro Asn Ala Gly Leu Pro Asn Ala Phe Gly Glu Tyr Asp Leu Asp Ala 
275 280 285 

gac acg atg gca aaa cag ata cgt gaa tgg gcg caa gcg ggt ttt etc 912 
Asp Thr Met Ala Lys Gin He Arg Glu Trp Ala Gin Ala Gly Phe Leu 
290 295 300 

aat ate gtc ggc ggc tgc tgt ggc acc acg cca caa cat att gca gcg 960 
Asn He Val Gly Gly Cys Cys Gly Thr Thr Pro Gin His He Ala Ala 
305 310 315 320 

atg agt cgt gca gta gaa gga tta gcg ccg cgc aaa ctg ccg gaa att 1008 
Met Ser Arg Ala Val Glu Gly Leu Ala Pro Arg Lys Leu Pro Glu He 
325 330 ~ 335 

ccc gta gee tgc cgt ttg tec ggc ctg gag ccg ctg aac att ggc gaa 1056 
Pro Val Ala Cys Arg Leu Ser Gly Leu Glu Pro Leu Asn He Gly Glu 
340 345 350 

gat age ctg ttt gtg aac gtg ggt gaa cgc acc aac gtc acc ggt tec 1104 
Asp Ser Leu Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser 
355 360 365 

get aag ttc aag cgc ctg ate aaa gaa gag aaa tac age gag gcg ctg 1152 
Ala Lys Phe Lys Arg Leu He Lys Glu Glu Lys Tyr Ser Glu Ala Leu 
370 375 380 



gat gtc gcg cgt caa cag gtg gaa aac ggc gcg cag att ate gat ate 
Asp Val Ala Arg Gin Gin Val Glu Asn Gly Ala Gin He He Asp He 
385 390 395 400 



1200 



aac atg gat gaa ggg atg etc gat gee gaa gcg gcg atg gtg cgt ttt 1248 
Asn Met Asp Glu Gly Met Leu Asp Ala Glu Ala Ala Met Val Arg Phe 
405 410 415 

etc aat ctg att gee ggt gaa ccg gat ate get cgc gtg ccg att atg 1296 
Leu Asn Leu He Ala Gly Glu Pro Asp He Ala Arg Val Pro He Met 
420 425 430 

ate gac tec tea aaa tgg gac gtc att gaa aaa ggt ctg aag tgt ate 1344 
He Asp Ser Ser Lys Trp Asp Val He Glu Lys Gly Leu Lys Cys He 
435 440 445 
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cag ggc aaa ggc att gtt aac tct ate teg atg aaa gag ggc gtc gat 1392 
Gin Gly Lys Gly lie Val Asn Ser He Ser Met Lye Glu Gly. Val Asp 
450 455 460 

gec ttt ate cat cac gcg aaa ttg ttg cgt cgc tac ggt gcg gca gtg 1440 
Ala Phe He Hie His Ala Lys Leu Leu Arg Arg Tyr Gly Ala Ala Val 
465 470 475 * 480 

gtg gta atg gee ttt gac gaa cag gga cag gee gat act cgc gca egg 1488 
Val Val Met Ala Phe Asp Glu Gin Gly Gin Ala Asp Thr Arg Ala Arg 
485 490 495 

aaa ate gag att tgc cgt egg gcg tac aaa ate etc ace gaa gag gtt 1536 
Lys He Glu He Cys Arg Arg Ala Tyr Lys He Leu Thr Glu Glu Val 
500 505 510 

ggc ttc ccg cca gaa gat ate ate ttc gac cca aac ate ttc gcg gtc 1584 
Gly Phe Pro Pro Glu Asp He He Phe Asp Pro Asn He Phe Ala Val 
515 520 525 

gca act ggc att gaa gag cac aac aac tac gcg cag gac ttt ate ggc 1632 
Ala Thr Gly He Glu Glu His Asn Asn Tyr Ala Gin Asp Phe He Gly 
530 535 540 

gcg tgt gaa gac ate aaa cgc gaa ctg ccg cac gcg ctg att tee ggc 1680 
Ala Cys Glu Asp He Lys Arg Glu Leu Pro His Ala Leu He Ser Gly 
545 550 555 560 

ggc gta tct aac gtt tct ttc teg ttc cgt ggc aac gat ccg gtg cgc 1728 
Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Asp Pro Val Arg 
565 570 575 

gaa gee att cac gca gtg ttc etc tac tac get att cgc aat ggc atg 1776 
Glu Ala He His Ala Val Phe Leu Tyr Tyr Ala He Arg Asn Gly Met 
580 585 590 

gat atg ggg ate gtc aac gee ggg caa ctg gcg att tac gac gac eta 1824 
Asp Met Gly He Val Asn Ala Gly Gin Leu Ala He Tyr Asp Asp Leu 
595 600 605 

ccc get gaa ctg cgc gac gcg gtg gaa gat gtg att ctt aat cgt cgc 1872 
Pro Ala Glu Leu Arg Asp Ala Val Glu Asp Val He Leu Asn Arg Arg 
610 615 620 

gac gat ggc ace gag cgt tta ctg gag ctt gee gag aaa tat cgc ggc 1920 
Asp Asp Gly Thr Glu Arg Leu Leu Glu Leu Ala Glu Lys Tyr Arg Gly 
625 630 635 ' 640 

age aaa ace gac gac ace gee aac gee cag cag gcg gag tgg cgc teg 1968 
Ser Lys Thr Asp Asp Thr Ala Asn Ala Gin Gin Ala Glu Trp Arg Ser 
645 650 655 

tgg gaa gtg aat aaa cgt ctg gaa tac teg ctg gtc aaa ggc att ace 2016 
Trp Glu Val Asn Lys Arg Leu Glu Tyr Ser Leu Val Lys Gly He Thr 
660 665 670 

gag ttt ate gag cag gat ace gaa gaa gee cgc cag cag get acg cgc 2064 
Glu Phe He Glu Gin Asp Thr Glu Glu Ala Arg Gin Gin Ala Thr Arg 
675 680 " 685 

ccg att gaa gtg att gaa ggc ccg ttg atg gac ggc atg aat gtg gtc 2112 
Pro He Glu Val He Glu Gly Pro Leu Met Asp Gly Met Asn Val Val 



I 
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690 695 700 

ggc gac ctg ttt ggc gaa ggg aaa atg ttc ctg cca cag gtg gtc aaa 2160 
Gly Asp Leu Phe Gly Glu Gly Lys Met Phe Leu Pro Gin Val Val Lye 
7 <>5 710 715 720 

teg gcg cgc gtc atg aaa cag gcg gtg gec tac etc gaa ccg ttt att 2208 
Ser Ala Arg Val Met Lys Gin Ala Val Ala Tyr Leu Glu Pro Phe He 
725 730 735 

gaa gec age aaa gag cag ggc aaa ace aac ggc aag atg gtg ate gec 2256 
Glu Ala Ser Lys Glu Gin Gly Lys Thr Asn Gly Lys Met Val He Ala 
740 745 750 

acc gtg aag ggc gac gtc cac gac ate ggt aaa aat ate gtt ggt gtg 2304 
Thr Val Lys Gly Asp Val His Asp He Gly Lys Asn He Val Gly Val 
755 760 765 

gtg ctg caa tgt aac aac tac gaa att gtc gat etc ggc gtt atg gtg 2352 
Val Leu Gin Cys Asn Asn Tyr Glu He Val Asp Leu Gly Val Met Val 
770 775 780 

cct gcg gaa aaa att etc cgt acc get aaa gaa gtg aat get gat ctg 2400 
Pro Ala Glu Lys He Leu Arg Thr Ala Lys Glu Val Asn Ala Asp Leu 
785 790 795 800 

att ggc ctt teg ggg ctt ate acg ccg teg ctg gac gag atg gtt aac 2448 
He Gly Leu Ser Gly Leu He Thr Pro Ser Leu Asp Glu Met Val Asn 
805 810 815 

gtg gcg aaa gag atg gag cgt cag ggc ttc act att ccg tta ctg att 2496 
val Ala Lys Glu Met Glu Arg Gin Gly Phe Thr He Pro Leu Leu He 
820 825 830 

ggc ggc gcg acg acc tea aaa gcg cac acg gcg gtg aaa ate gag cag 2544 
Gly Gly Ala Thr Thr Ser Lys Ala His Thr Ala Val Lys He Glu Gin 
835 840 845 

aac tac age ggc ccg acg gtg tat gtg cag aat gec teg cgt acc gtt 2592 
Asn Tyr Ser Gly Pro Thr Val Tyr Val Gin Asn Ala Ser Arg Thr Val 
850 855 860 

99t gtg gtg gcg gcg ctg ctt tec gat acc cag cgt gat gat ttt gtc 2640 
Gly Val Val Ala Ala Leu Leu Ser Asp Thr Gin Arg Asp Asp Phe Val 
865 870 875 860 

get cgt acc cgc aag gag tac gaa acc gta cgt att cag cac ggg cgc 2688 
Ala Arg Thr Arg Lys Glu Tyr Glu Thr Val Arg He Gin His Gly Arg 
885 890 895 

aag aaa ccg cgc aca cca ccg gtc acg ctg gaa gcg gcg cgc gat aac 2736 
Lys Lys Pro Arg Thr Pro Pro Val Thr Leu Glu Ala Ala Arg Asp Asn 
900 905 910 

gat ttc get ttt gac tgg cag get tac acg ccg ccg gtg gcg cac cgt 2784 
Asp Phe Ala Phe Asp Trp Gin Ala Tyr Thr Pro Pro Val Ala His Arg 
915 920 925 

etc ggc gtg cag gaa gtc gaa gee age ate gaa acg ctg cgt aat tac 2832 
Leu Gly Val Gin Glu Val Glu Ala Ser He Glu Thr Leu Arg Asn Tyr 
930 935 940 
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ate gac tgg aca ccg ttc ttt atg acc tgg teg ctg gec ggg aag tat 2660 
He Asp Trp Thr Pro Phe Phe Met Thr Trp Ser Leu Ala Gly. Lys Tyr 
945 950 955 960 

ccg cgc att ctg gaa gat gaa gtg gtg ggc gtt gag gcg cag egg ctg 2928 
Pro Arg He Leu Glu Asp Glu Val Val Gly Val Glu Ala Gin Arg Leu 
965 970 975 

ttt aaa gac gec aac gac atg ctg gat aaa tta age gec gag aaa acg 2976 
Phe Lye Asp Ala Asn Asp Met Leu Asp, Lys Leu Ser Ala Glu Lys Thr 
980 965 990 

ctg aat ccg cgt ggc gtg gtg ggc ctg ttc ccg gca aac cgt gtg ggc 3024 
Leu Asn Pro Arg Gly Val Val Gly Leu Phe Pro Ala Asn Arg Val Gly 
995 1000 1005 

gat gac att gaa ate tac cgt gac gaa acg cgt acc cat gtg ate aac 3072 
Asp Asp He Glu He Tyr Arg Asp Glu Thr Arg Thr His Val He Asn 
1010 1015 1020 

gtc age cac cat ctg cgt caa cag acc gaa aaa aca ggc ttc get aac 3120 
Val Ser His His Leu Arg Qln Gin Thr Glu Lys Thr Gly Phe Ala Asn 
1025 1030 1035 " 1040 

tac tgt etc get gac ttc gtt gcg ccg aag ctt tct ggt aaa gca gat 3168 
Tyr Cys Leu Ala Asp Phe Val Ala Pro Lys Leu Ser Gly Lys Ala Asp 
1045 1050 1055 

tac ate ggc gca ttt gec gtg act ggc ggg ctg gaa gag gac gca ctg 3216 
Tyr He Gly Ala Phe Ala Val Thr Gly Gly Leu Glu Glu Asp Ala Leu 
1060 1065 1070 

get gat gee ttt gaa gcg cag cac gat gat tac aac aaa ate atg gtg 3264 
Ala Asp Ala Phe Glu Ala Gin His Asp Asp Tyr Asn Lys He Met Val 
1075 1080 * * 1085 



aaa gcg ctt gee gac cgt tta gee gaa gec ttt gcg gag tat etc cat 3312 
Lys Ala Leu Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Tyr Leu His 
1090 1095 1100 

gag cgt gtg cgt aaa gtc tac tgg ggc tat gcg ccg aac gag aac etc 3360 
Glu Arg Val Arg Lys Val Tyr Trp Gly Tyr Ala Pro Asn Glu Asn Leu 
H05 1110 1115 H20 

age aac gaa gag ctg ate cgc gaa aac tac cag ggc ate cgt ccg gca 3408 
Ser Asn Glu Glu Leu He Arg Glu Asn Tyr Gin Gly He Arg Pro Ala 
1125 1130 1135 

ccg ggc tat ccg gee tgc ccg gaa cat acg gaa aaa gec acc ate tgg 3456 
Pro Gly Tyr Pro Ala Cys Pro Glu His Thr Glu Lys Ala Thr He Trp 
1140 H45 1150 

gag ctg ctg gaa gtg gaa aaa cac act ggc atg aaa etc aca gaa tct 3504 
Glu Leu Leu Glu Val Glu Lys His Thr Gly Met Lys Leu Thr Glu Ser 
1155 1160 1165 

ttc gee atg tgg ccc ggt gca teg gtt teg ggt tgg tac ttc age cac 3552 
Phe Ala Met Trp Pro Gly Ala Ser Val Ser Gly Trp Tyr Phe Ser His 
1170 1175 1180 

ccg gac age aag tac tac get gta gca caa att cag cgc gat cag gtt 3600 
Pro Asp Ser Lys Tyr Tyr Ala Val Ala Gin He Gin Arg Asp Gin Val 
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H85 H90 U95 1200 

gaa gat tat gcc cgc cgt aaa ggt atg age gtt acc gaa gtt gag cgc 3648 

Glu Asp Tyr Ala Arg Arg Lys Gly Met Ser Val Thr Glu Val Glu Arg 
1205 1210 1215 

tgg ctg gca ccg aat ctg ggg tat gac gcg gac tga 3664 

Trp Leu Ala Pro Asn Leu Gly Tyr Asp Ala Asp 
1220 1225 



<210> 20 
<211> 1227 
<212> PRT 

<2 13 > Escherichia coli 
<400> 20 

Val Ser Ser Lys Val Glu Gin Leu Arg Ala Gin Leu Asn Glu Arg lie 
1 5 10 is 

Leu Val Leu Asp Gly Gly Met Gly Thr Met lie Gin Ser Tyr Arg Leu 
20 25 30 

Asn Glu Ala Asp Phe Arg Gly Glu Arg Phe Ala Asp Trp Pro Cys Asp 
35 40 45 

Leu Lys Gly Asn Asn Abp Leu Leu Val Leu Ser Lys Pro Glu Val He 
50 55 60 

Ala Ala He His Asn Ala Tyr Phe Glu Ala Gly Ala Asp He He Glu 
65 70 75 80 

Thr Asn Thr Phe Asn Ser Thr Thr He Ala Met Ala Asp Tyr Gin Met 
85 90 95 

Glu Ser Leu Ser Ala Glu lie Asn Phe Ala Ala Ala Lys Leu Ala Arg 
100 105 no 

Ala Cys Ala Asp Glu Trp Thr Ala Arg Thr Pro Glu Lys Pro Arg Tyr 
115 120 125 

Val Ala Gly Val Leu Gly Pro Thr Asn Arg Thr Ala Ser He Ser Pro 
130 135 140 

Asp Val Asn Asp Pro Ala Phe Arg Asn He Thr Phe Asp Gly Leu Val 
145 150 155 160 

Ala Ala Tyr Arg Glu Ser Thr Lys Ala Leu Val Glu Gly Gly Ala Asp 
165 170 175 

Leu He Leu He Glu Thr Val Phe Asp Thr Leu Asn Ala Lys Ala Ala 
180 185 190 

Val Phe Ala Val Lys Thr Glu Phe Glu Ala Leu Gly Val Glu Leu Pro 
195 200 205 

He Met He Ser Gly Thr He Thr Asp Ala Ser Gly Arg Thr Leu Ser 
210 215 220 

Gly Gin Thr Thr Glu Ala Phe Tyr Asn Ser Leu Arg His Ala Glu Ala 
225 230 235 240 
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Leu Thr Phe Gly Leu Asn Cys Ala Leu Gly Pro Asp Glu Leu Arg Gin 
245 250 255 

Tyr Val Gin Glu Leu Ser Arg He Ala Glu Cys Tyr Val Thr Ala His 
260 265 270 " . 

Pro Asn Ala Gly Leu Pro Asn Ala Phe Gly Glu Tyr Asp Leu Asp Ala 
275 280 285 

Asp Thr Met Ala Lys Gin He Arg Glu Trp Ala Gin Ala Gly Phe Leu 
290 295 300 

Asn He Val Gly Gly Cys Cys Gly Thr Thr Pro Gin His lie Ala Ala 
305 310 315 320 

Met Ser Arg Ala Val Glu Gly Leu Ala Pro Arg Lys Leu Pro Glu He 
325 330 335 

Pro Val Ala Cys Arg Leu Ser Gly Leu Glu Pro Leu Asn He Gly Glu 
340 345 350 

Asp Ser Leu Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser 
355 360 365 

Ala Lys Phe Lys Arg Leu He Lys Glu Glu Lys Tyr Ser Glu Ala Leu 
370 375 380 

Asp Val Ala Arg Gin Gin Val Glu Asn Gly Ala Gin He He Asp He 
385 390 395 400 

Asn Met Asp Glu Gly Met Leu Aep Ala Glu Ala Ala Met Val Arg Phe 
405 410 415 

Leu Asn Leu He Ala Gly Glu Pro Asp He Ala Arg Val Pro He Met 
420 425 430 

He Asp Ser Ser Lys Trp Asp Val He Glu Lys Gly Leu Lys Cys He 
435 440 445 

Gin Gly Lys Gly He Val Asn Ser He Ser Met Lys Glu Gly Val Asp 
450 455 460 

Ala Phe He His His Ala Lys Leu Leu Arg Arg Tyr Gly Ala Ala Val 
465 470 475 * 480 

Val Val Met Ala Phe Asp Glu Gin Gly Gin Ala Asp Thr Arg Ala Arg 
485 490 495 

Lys He Glu He Cys Arg Arg Ala Tyr Lys He Leu Thr Glu Glu Val 
500 505 510 

Gly Phe Pro Pro Glu Asp He He Phe Asp Pro Asn He Phe Ala Val 
515 520 525 

Ala Thr Gly He Glu Glu His Asn Asn Tyr Ala Gin Asp Phe He Gly 
530 535 540 

Ala Cys Glu Asp He Lys Arg Glu Leu Pro His Ala Leu He Ser Gly 
545 550 555 560 

Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Asp Pro Val Arg 
565 570 575 
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Glu Ala lie His Ala Val Phe Leu Tyr Tyr Ala lie Arg Asn Qly Met 
580 585 590 

Asp Met Gly lie Val Asn Ala Gly Gin Leu Ala He Tyr Asp Asp Leu 
595 600 605 

Pro Ala Glu Leu Arg Asp Ala Val Glu Asp Val He Leu Asn Arg Arg 
610 615 620 

Asp Asp Gly Thr Glu Arg Leu Leu Glu Leu Ala Glu Lys Tyr Arg Gly 
625 630 635 640 

Ser Lys Thr Asp Asp Thr Ala Asn Ala Gin Gin Ala Glu Trp Arg Ser 
645 650 655 

Trp Glu Val Asn Lys Arg Leu Glu Tyr Ser Leu Val Lys Gly He Thr 
660 665 670 

Glu Phe He Glu Gin Asp Thr Glu Glu Ala Arg Gin Gin Ala Thr Arg 
675 680 685 

Pro He Glu Val He Glu Gly Pro Leu Met Asp Gly Met Asn Val Val 
690 695 700 

Gly Asp Leu Phe Gly Glu Gly Lys Met Phe Leu Pro Gin Val Val Lys 
705 710 715 720 

Ser Ala Arg Val Met Lys Gin Ala Val Ala Tyr Leu Glu Pro Phe He 
725 730 735 

Glu Ala Ser Lys Glu Gin Gly Lys Thr Asn Gly Lys Met Val He Ala 
740 745 750 

Thr Val Lys Gly Asp Val His Asp He Gly Lys Asn He Val Gly Val 
755 760 765 

Val Leu Gin Cys Asn Asn Tyr Glu He Val Asp Leu Gly Val Met Val 
770 775 780 

Pro Ala Glu Lys He Leu Arg Thr Ala Lys Glu Val Asn Ala Abp Leu 
785 790 795 B00 

He Gly Leu Ser Gly Leu He Thr Pro Ser Leu Asp Glu Met Val Asn 
805 810 815 

Val Ala Lys Glu Met Glu Arg Gin Gly Phe Thr He Pro Leu Leu He 
820 825 830 

Gly Gly Ala Thr Thr Ser Lys Ala His Thr Ala Val Lys He Glu Gin 
835 840 845 

Asn Tyr Ser Gly Pro Thr Val Tyr Val Gin Asn Ala Ser Arg Thr Val 
850 855 860 

Gly Val Val Ala Ala Leu Leu Ser Asp Thr Gin Arg Asp Asp Phe Val 
865 870 875 880 

Ala Arg Thr Arg Lys Glu Tyr Glu Thr Val Arg He Gin His Gly Arg 
885 890 895 

Lys Lys Pro Arg Thr Pro Pro Val Thr Leu Glu Ala Ala Arg Asp Asn 
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900 



905 



910 



Asp Phe Ala Phe Asp Trp Gin Ala Tyr Thr Pro Pro Val Ala Hie Arg 
515 920 925 

Leu Gly Val Gin Glu Val Glu Ala Ser He Glu Thr Leu Arg Asn Tyr 
930 935 940 

He Asp Trp Thr Pro Phe Phe Met Thr Trp Ser Leu Ala Gly Lys Tyr 
945 950 955 960 

Pro Arg He Leu Glu Asp Glu Val Val Gly Val Glu Ala Gin Arg Leu 
965 970 975 

Phe Lys Asp Ala Asn Asp Met Leu Asp Lys Leu Ser Ala Glu Lys Thr 
980 985 990 

Leu Asn Pro Arg Gly Val Val Gly Leu Phe Pro Ala Asn Arg Val Gly 
995 1000 1005 

Asp Asp He Glu He Tyr Arg Asp Glu Thr Arg Thr His Val He Asn 
1010 1015 1020 

Val Ser His His Leu Arg Gin Gin Thr Glu Lys Thr Gly Phe Ala Asn 
1025 1030 1035 1040 

Tyr Cys Leu Ala Asp Phe Val Ala Pro Lys Leu Ser Gly Lys Ala Asp 
1045 1050 1055 

Tyr He Gly Ala Phe Ala Val Thr Gly Gly Leu Glu Glu Asp Ala Leu 
1060 1065 1070 

Ala Asp Ala Phe Glu Ala Gin His Asp Asp Tyr Asn Lys He Met Val 
1075 1080 1085 

Lys Ala Leu Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Tyr Leu His 
1090 1095 1100 

Glu Arg Val Arg Lys Val Tyr Trp Gly Tyr Ala Pro Asn Glu Asn Leu 
1105 1110 1115 1120 

Ser Asn Glu Glu Leu He Arg Glu Asn Tyr Gin Gly He Arg Pro Ala 
1125 1130 * H35 

Pro Gly Tyr Pro Ala Cys Pro Glu His Thr Glu Lys Ala Thr He Trp 
1140 1145 1150 

Glu Leu Leu Glu Val Glu Lys His Thr Gly Met Lys Leu Thr Glu Ser 
1155 1160 1165 

Phe Ala Met Trp Pro Gly Ala Ser Val Ser Gly Trp Tyr Phe Ser His 
1170 1175 1180 

Pro Asp Ser Lys Tyr Tyr Ala Val Ala Gin He Gin Arg Asp Gin Val 
H85 1190 1195 1200 

Glu Asp Tyr Ala Arg Arg Lys Gly Met Ser Val Thr Glu Val Glu Arg 
1205 1210 1215 

Trp Leu Ala Pro Asn Leu Gly Tyr Asp Ala Asp 



1220 



1225 



( 
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<210> 21 
<211> 3771 
<212> DNA 

<213> Salmonella typhimurium 

<220> 
<221> CDS 
<222> (1)..(3768) 
<223> RSY02996 

<400> 21 

atg tct cat gtt gcc cgt tgt tct ctt ttc cgc cag cac get ttg tgc 48 

Met Ser His Val Ala Arg Cys Ser Leu Phe Arg Gin His Ala Leu Cys 

1 5 10 15 

cag tat ggc teg tta cgt gga gcg ttg teg gga gcg agt gtg age age 96 
Gin Tyr Gly Ser Leu Arg Gly Ala Leu Ser Gly Ala Ser Val Ser Ser 
20 25 30 

aaa gtt gaa caa ctg cgt gcg cag tta aat gaa cgt att ctg gtg ctg 144 
Lys Val Glu Gin Leu Arg Ala Gin Leu Asn Glu Arg lie Leu Val Leu 
35 40 ^ 45 

gac ggc ggt atg ggc act atg ate cag age tat cgt eta cat gaa gaa 192 
Asp Gly Gly Met Gly Thr Met He Gin Ser Tyr Arg Leu His Glu Glu 
50 55 60 

gat ttc cgc ggg gag cgc ttt gcc gac tgg ccc tgc gac ctg aaa ggc 240 
Asp Phe Arg Gly Glu Arg Phe Ala Asp Trp Pro Cys Asp Leu Lys Gly 
65 70 75 80 

aac aat gac ctg ctg gtc etc age aag ccg gag gtg ate gcc get ate 288 
Asn Asn Asp Leu Leu Val Leu Ser Lys Pro Glu Val He Ala Ala He 
85 90 95 

cac aac gcc tac ttt gag get ggc gcg gat ate ate gaa ace aac ace 336 
His Asn Ala Tyr Phe Glu Ala Gly Ala Asp He He Glu Thr Asn Thr 
100 105 110 

ttt aac teg aca ace att gcg atg gcg gat tac egg atg gaa tec ctg 384 
Phe Asn Ser Thr Thr He Ala Met Ala Asp Tyr Arg Met Glu Ser Leu 
115 120 125 

teg gcg gaa att aac tat gcg gcg gcc aaa ctg gcg cgc gcc tgc gcc 432 
Ser Ala Glu He Asn Tyr Ala Ala Ala Lye Leu Ala Arg Ala Cys Ala 
130 135 140 

gat gaa tgg acg gcg cga aca cca gaa aaa cca cgc ttt gtt gcg ggc 480 
Asp Glu Trp Thr Ala Arg Thr Pro Glu Lys Pro Arg Phe val Ala Gly 
145 150 155 160 

gtg ctt ggt cca act aac cgc acg gcc tec att teg ccg gac gtc aac 528 
Val Leu Gly Pro Thr Asn Arg Thr Ala Ser He Ser Pro Asp Val Asn 
165 170 175 

gac ccg gcg ttt cgt aat ate acc ttc gat cag ctg gtg gcg gcc tac 576 
Asp Pro Ala Phe Arg Asn lie Thr Phe Asp Gin Leu Val Ala Ala Tyr 
180 185 190 

cgt gaa tec acc aaa gcg ctg gtg gaa ggt ggc gca gat ctg att ctg 624 
Arg Glu Ser Thr Lys Ala Leu Val Glu Gly Gly Ala Asp Leu He Leu 
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195 200 205 

att gag acc gtt ttt gac acc ctg aat gcg aaa gcg gcg gtg ttt gcg 672 
He Glu Thr Val Phe Asp Thr Leu Asn Ala Lye Ala Ala Val Phe Ala 
210 215 220 

gtg aaa gaa gag ttt gaa gcg ctg ggc gtt gac ctg ccg ate atg att 720 
Val Lys Glu Glu Phe Glu Ala Leu Gly Val Asp Leu Pro He Met He 
225 230 235 240 

tec ggc acc ate acc gac gee tct ggc cgt acg ctt tec ggc cag act 768 
Ser Gly Thr lie Thr Asp Ala Ser Gly Arg Thr Leu Ser Gly Gin Thr 
245 250 255 

acc gaa gee ttt tat aac teg ctg cgc cac gee gag gcg etc act ttt 816 
Thr Glu Ala Phe Tyr Asn Ser Leu Arg His Ala Glu Ala Leu Thr Phe 
260 265 270 

ggc ctt aac tgc gca ctg ggg cca gat gaa ctg cgc cag tac gtc cag 864 
Gly Leu Asn Cys Ala Leu Gly Pro Asp Glu Leu Arg Gin Tyr Val Gin 
275 280 285 

gaa ctg teg egg att gee gaa tgc tac gtc acc gcg cac ccg aac gee 912 
Glu Leu Ser Arg He Ala Glu Cys Tyr Val Thr Ala His Pro Asn Ala 
290 295 300 

ggc ctg ccg aac get ttc ggc gag tat gac etc gac gee gac acc atg 960 
Gly Leu Pro Asn Ala Phe Gly Glu Tyr Asp Leu Asp Ala Asp Thr Met 
305 310 315 320 

gcg aaa cag att cgc gaa tgg gcg gaa gcg ggc ttc ctg aat ate gtt 1008 
Ala Lys Gin He Arg Glu Trp Ala Glu Ala Gly Phe Leu Asn He Val 
325 330 * 335 

ggc ggc tgc tgc ggc acc acg ccg gag cat att gcg gcg atg age cgc 1056 
Gly Gly Cys Cys Gly Thr Thr Pro Glu His He Ala Ala Met Ser Arg 
340 345 350 

gee gtt gee ggt ttg ctg ccg cgc cag ctg ccg gat ate ccg gtc gee 1104 
Ala Val Ala Gly Leu Leu Pro Arg Gin Leu Pro Asp He Pro Val Ala > 
355 360 365 

tgc cgc ctt tec ggc ctg gag ccg ctg aac att ggc gac gat age ctg 1152 
Cys Arg Leu Ser Gly Leu Glu Pro Leu Asn He Gly Asp Asp Ser Leu 
370 375 380 



ttt gtg aac gtc ggc gaa cgt act aac gtc acc ggc teg gec aaa ttt 
Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Ala Lys Phe 
385 390 395 400 



1200 



aaa egg ctg ate aaa gaa gag aaa tac age gaa gcg ctg gat gtc gee 1248 
Lys Arg Leu He Lys Glu Glu Lye Tyr Ser Glu Ala Leu Asp Val Ala 
405 410 415 

cgt cag cag gta gaa age ggc gcg cag att att gat ate aat atg gat 1296 
Arg Gin Gin Val Glu Ser Gly Ala Gin He He Asp He Asn Met Asp 
420 42S 430 

gag ggg atg etc gac gee gaa gcg gcg atg gtg cgt ttc etc age ctg 1344 
Glu Gly Met Leu Asp Ala Glu Ala Ala Met Val Arg Phe Leu Ser Leu 
435 440 445 
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att gcc ggt gag ccg gac att gcc cgt gta cca ate atg ate gac tec 1392 
He Ala Gly Glu Pro Asp He Ala Arg Val Pro He Met lie Asp Ser 
450 455 460 

tec aaa tgg gag gtt ate gaa aaa ggg ctg aag tgc att cag ggt aaa 1440 
Ser Lye Trp Glu Val He Glu Lys Gly Leu LyB Cys He Gin Gly Lys 
465 470 475 4B0 

ggc ate gtc aac tct att teg atg aaa gag ggc gtg gaa gcc ttt att 1488 
Gly He Val Asn Ser He Ser Met Lys . Glu Gly Val Glu Ala Phe He 
485 490 495 

cat cat gcg aag ctt ctg cgt cgc tac ggc gcg gca gtg gtg gtg atg 1536 
His His Ala Lys Leu Leu Arg Arg Tyr Gly Ala Ala Val Val Val Met 
500 505 5X0 

gcc ttt gat gag cag ggg cag gcc gat ace cgc gcg cgt aaa ate gaa 1584 
Ala Phe Asp Glu Gin Gly Gin Ala Asp Thr Arg Ala Arg Lye He Glu 
515 520 525 

att tgc cgc cga gcc tac aaa att etc ace gaa gag gtg ggt ttc ccg 1632 
He Cys Arg Arg Ala Tyr Lys He Leu Thr Glu Glu Val Gly Phe Pro 
530 535 540 

ccg gaa gac ate ate ttc gac ccg aat ate ttc gcc gtg gcg ace ggt 1680 
Pro Glu Asp He He Phe Asp Pro Asn He Phe Ala Val Ala Thr Gly 
545 550 555 560 

att gaa gag cac aac aac tac gcg cag gac ttt ate ggc get tgc gaa 1728 
He Glu Glu His Asn Asn Tyr Ala Gin Asp Phe He Gly Ala Cys Glu 
565 570 575 

gac ate aaa cgc gag ctg ccg cac gcg ctg ate tec ggc ggc gtg tct 1776 
Asp He Lys Arg Glu Leu Pro His Ala Leu He Ser Gly Gly Val Ser 
580 585 590 

aac gtg tec ttc teg ttc cgc ggc aac gac ccg gta cgt gag get ate 1824 
Asn Val Ser Phe Ser Phe Arg Gly Asn Asp Pro Val Arg Glu Ala He 
595 600 605 

cac gcg gta ttc etc tac tac gcc ate cgc aac ggt atg gac atg ggc 1872 
His Ala Val Phe Leu Tyr Tyr Ala He Arg Asn Gly Met Asp Met Gly 
610 615 620 

ate gtc aac gcc gga cag ctg get ate tac gac gac ctg ccc gcc gag 1920 
He Val Asn Ala Gly Gin Leu Ala He Tyr Asp Asp Leu Pro Ala Glu 
625 630 635 640 

ctg cgc gat gcg gtt gaa gat gtc att ctt aac cgt cgc gat gac ggc 1968 
Leu Arg Asp Ala Val Glu Asp Val He Leu Asn Arg Arg Asp Asp Gly 
645 650 655 

act gag cgt ttg ctg gat ttg gcg gag aaa tac cgc ggc age aaa acc 2016 
Thr Glu Arg Leu Leu Asp Leu Ala Glu Lys Tyr Arg Gly Ser Lys Thr 
660 665 670 

gac gaa get gcc aac gcc cag cag gcg gaa tgg cgt age tgg gac gtg 2064 
Asp Glu Ala Ala ABn Ala Gin Gin Ala Glu Trp Arg Ser Trp Asp Val 
675 680 685 

aaa aag cgt etc gaa tac teg ctg gtg aaa ggc att acc gaa ttt ate 2112 
Lys Lys Arg Leu Glu Tyr Ser Leu Val Lys Gly He Thr Glu Phe He 
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690 695 700 

gaa cag gat acc gaa gaa gcc cgt cag cag gcc gcc cgc ccg att gag 2160 
Glu Gin Asp Thr Glu Glu Ala Arg Gin Gin Ala Ala Arg Pro He Glu 
705 710 715 720 

gtg att gaa ggg ccg ttg atg gac ggc atg aac gtg gtc ggc gac ctg 2208 
Val He Glu Gly Pro Leu Met Asp Gly Met Asn Val Val Gly Asp Leu 
725 730 735 

ttc ggc gaa ggg aaa atg ttc ctg ccg cag gtg gtg aaa tec get cgc 2256 
Phe Gly Glu Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala Arg 
740 745 750 

gtg atg aaa caa gcg gtg gcc tac ctg gag ccg ttt att gaa gcc age 2304 
Val Met Lys Gin Ala Val Ala Tyr Leu Glu Pro Phe He Glu Ala Ser 
755 760 765 

aaa gaa aaa ggc tec age aac ggc aag atg gtg ate gcc acc gtg aag 2352 
Lys Glu Lys Gly Ser Ser Asn Gly Lys Met Val He Ala Thr Val Lys 
770 775 780 

ggc gat gtg cac gat att ggt aaa aat ate gtt ggc gtg gtg ctg caa 2400 
Gly Asp Val His Asp He Gly Lys Asn lie Val Gly Val Val Leu Gin 
785 790 795 800 

tgt aac aac tac gaa ate gtc gat ctt ggc gtg atg gtg cca gcg gag 2448 
Cys Asn Asn Tyr Glu He Val Asp Leu Gly Val Met Val Pro Ala Glu 
805 810 815 

aaa ate etc aga acg gcg cgt gaa gtg aat gcc gat ctg ate ggt ctt 2496 
Lys He Leu Arg Thr Ala Arg Glu Val Asn Ala Asp Leu He Gly Leu 
820 825 830 

tec ggg ctg att acc ccg teg ctg gac gaa atg gtc aac gtg gcg aaa 2544 
Ser Gly Leu He Thr Pro Ser Leu Asp Glu Met Val Asn Val Ala Lys 
835 840 645 

gag atg gag cgt cag ggc ttt act ate ccg eta ctg ate ggc ggc gca 2592 
Glu Met Glu Arg Gin Gly Phe Thr He Pro Leu Leu He Gly Gly Ala 
850 855 860 

acc act teg aaa gcg cat acg gcg gtg aaa ate gag cag aac tac age 2640 
Thr Thr Ser Lys Ala His Thr Ala Val Lys He Glu Gin Asn Tyr Ser 
865 870 875 880 

ggt ccg acg gtc tac gtg cag aat get teg cgt acc gtg ggc gtg gtg 2688 
Gly Pro Thr Val Tyr Val Gin Asn Ala Ser Arg Thr Val Gly Val Val 
885 890 895 

gcg gcg eta etc tec gac acc cag cgt gat gac ttt gtc gcc cgt acc 2736 
Ala Ala Leu Leu Ser Asp Thr Gin Arg Asp Asp Phe Val Ala Arg Thr 
900 905 910 

cgc aaa gag tac gaa acc gtg cgt att cag cac gcc cgc aaa aaa ccg 2784 
Arg Lys Glu Tyr Glu Thr Val Arg He Gin His Ala Arg Lys Lys Pro 
915 920 925 

cgc acg ccg ccg gtc acg ctg gag gcg gcg cgc gat aac gat ctg gca 2832 
Arg Thr Pro Pro Val Thr Leu Glu Ala Ala Arg Asp Asn Asp Leu Ala 
930 935 940 
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ttc gat tgg gaa cgc tac acc ccg ccg gta gcc cac cgt ctg ggc gtg 2880 

Phe Asp Trp Glu Arg Tyr Thr Pro Pro Val Ala His Arg Leu Gly Val 

945 950 955 960 

, * 

cag gag gtg gaa gcc age ate gaa act ctg cgc aac tat ate gac tgg 2928 

Gin Glu Val Glu Ala Ser He Glu Thr Leu Arg Asn Tyr He Asp Trp 
965 970 975 

acg ccg ttc ttt atg acc tgg teg ctg gcc ggc aaa tac ccg cgc att 2976 
Thr Pro Phe Phe Met Thr Trp Ser Leu Ala Gly Lys Tyr Pro Arg He 
980 985 990 

ctg gaa gat gag gtg gtg ggc gtt gag geg cag cgt ctg ttt aaa gac 3024 
Leu Glu Asp Glu Val Val Gly Val Glu Ala Gin Arg Leu Phe Lys Asp 
995 1000 1005 

gcc aat gat atg ctg gat aaa ctg age gcc gag aaa ctg ttg aat ccg 3072 
Ala Asn Asp Met Leu Asp Lys Leu Ser Ala Glu Lys Leu Leu Asn Pro 
1010 1015 1020 

cgt ggc gtg gtg ggc ctg ttc ccg gcg aac cgt gtg ggt gac gac ate 3120 
Arg Gly Val Val Gly Leu Phe Pro Ala Asn Arg Val Gly Asp Asp He 
1025 1030 1035 1040 

gaa ate tat cgc gac gaa acc cgt act cat gtt ctg acg gtc age cac 3168 
Glu He Tyr Arg Asp Glu Thr Arg Thr Hie Val Leu Thr Val Ser His 
1045 1050 1055 

cac ctg cgc cag cag acc gag aaa gtc ggt ttt gcc aac tac tgt ctg 3216 
His Leu Arg Gin Gin Thr Glu Lys Val Gly Phe Ala Asn Tyr Cys Leu 
1060 1065 1070 

gcg gat ttt gtc gcg ccg aaa ctg age ggc aaa gcg gat tac ate ggt 3264 
Ala Asp Phe Val Ala Pro Lys Leu Ser Gly Lye Ala Asp Tyr lie Gly 
1075 1080 1085 



get ttc gcg gtg acc ggc ggt ctg gag gag gat gcg ctg gcg gac gcc 3312 
Ala Phe Ala Val Thr Gly Gly Leu Glu Glu Asp Ala Leu Ala Asp Ala 
1090 1095 1100 

ttc gaa gcg caa cac gac gac tat aac aag ate atg gtg aaa gcg att 3360 
Phe Glu Ala Gin His Asp Asp Tyr Asn Lys He Met Val Lys Ala He 
1105 1110 1115 1120 



gcc gac cgt ctg gcg gaa gcg ttc gcg gaa tat ctg cat gag cgt gta 3408 
Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Tyr Leu His Glu Arg Val 
1125 1130 1135 

cgt aag gtt tac tgg gga tat gcg ccg aac gag age ctg agt aac gac 3456 
Arg Lys Val Tyr Trp Gly Tyr Ala Pro Asn Glu Ser Leu Ser Asn Asp 
1140 1145 1150 



gaa tta ate cgc gaa aac tac cag ggg att cgc ccg gcg ccg ggt tat 3504 
Glu Leu He Arg Glu Asn Tyr Gin Gly He Arg Pro Ala Pro Gly Tyr 
1155 1160 1165 

cct gcc tgc ccg gaa cat acc gaa aaa ggc act ate tgg cag eta ctg 3552 
Pro Ala CyB Pro Glu His Thr Glu Lys Gly Thr He Trp Gin Leu Leu 
1170 1175 1180 

gat gtc gaa aaa cac acc ggg atg aag etc acc gaa tct ttc gcc atg 3600 
Asp Val Glu Lys His Thr Gly Met Lys Leu Thr Glu Ser Phe Ala Met 
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1185 1190 1195 1200 

tgg cca ggc gcg teg gtc tec ggc tgg tac ttc age cat cct gag age 3648 
Trp Pro Gly Ala Ser Val Ser Gly Trp Tyr Phe Ser His Pro Glu Ser 
1205 1210 1215 

aaa tac ttc gcg gta gcg cag ate caa cgc gat cag gtg aca gat tat 3696 
Lys Tyr Phe Ala Val Ala Gin He Gin Arg Asp Gin Val Thr Asp Tyr 
1220 1225 * 1230 

get ttc cgt aaa gga atg age gtt gag gat gtt gag egg tgg etc gcg 3744 
Ala Phe Arg Lys Gly Met Ser Val Glu Asp Val Glu Arg Trp Leu Ala 
1235 1240 1245 

ccg aac ctg ggt tac gat gcg gac tga 3771 
Pro Asn Leu Gly Tyr Asp Ala Asp 
1250 1255 



<210> 22 
<211> 1256 
<212> PRT 

<213> Salmonella typhimurium 
<400> 22 

Met Ser His Val Ala Arg Cys Ser Leu Phe Arg Gin His Ala Leu Cys 
1 5 10 15 

Gin Tyr Gly Ser Leu Arg Gly Ala Leu Ser Gly Ala Ser Val Ser Ser 
20 25 30 

Lys Val Glu Gin Leu Arg Ala Gin Leu Asn Glu Arg He Leu Val Leu 
35 40 45 

Asp Gly Gly Met Gly Thr Met He Gin Ser Tyr Arg Leu His Glu Glu 
50 55 " 60 

Asp Phe Arg Gly Glu Arg Phe Ala Asp Trp Pro Cys Asp Leu Lys Gly 
65 70 75 80 

Asn Asn Abp Leu Leu Val Leu Ser Lys Pro Glu Val He Ala Ala He 
85 90 95 

His Asn Ala Tyr Phe Glu Ala Gly Ala Asp He He Glu Thr Asn Thr 
100 105 110 

Phe Asn Ser Thr Thr He Ala Met Ala Asp Tyr Arg Met Glu Ser Leu 
115 120 125 

Ser Ala Glu He Asn Tyr Ala Ala Ala Lys Leu Ala Arg Ala Cys Ala 
130 135 140 

Asp Glu Trp Thr Ala Arg Thr Pro Glu Lys Pro Arg Phe Val Ala Gly 
145 150 155 160 

Val Leu Gly Pro Thr Asn Arg Thr Ala Ser He Ser Pro Asp Val Asn 
165 170 175 

Asp Pro Ala Phe Arg Asn He Thr Phe Asp Gin Leu Val Ala Ala Tyr 
180 185 190 

Arg Glu Ser Thr Lys Ala Leu Val Glu Gly Gly Ala Asp Leu He Leu 
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195 200 205 

He Glu Thr Val Phe Asp Thr Leu Asn Ala Lys Ala Ala Val Phe Ala 
210 215 220 

Val Lye Glu Glu Phe Glu Ala Leu Gly Val Asp Leu Pro He Met He 
225 230 235 240 

Ser Gly Thr He Thr Asp Ala Ser Gly Arg Thr Leu Ser Gly Gin Thr 
245 250 255 

Thr Glu Ala Phe Tyr Asn Ser Leu Arg His Ala Glu Ala Leu Thr Phe 
260 265 270 

Gly Leu Asn Cys Ala Leu Gly Pro Asp Glu Leu Arg Gin Tyr Val Gin 
275 280 285 

Glu Leu Ser Arg He Ala Glu Cys Tyr Val Thr Ala His Pro Asn Ala 
290 295 300 

Gly Leu Pro Asn Ala Phe Gly Glu Tyr Asp Leu Asp Ala Asp Thr Met 
305 310 315 320 

Ala Lys Gin He Arg Glu Trp Ala Glu Ala Gly Phe Leu Asn He Val 
325 330 335 

Gly Gly Cys Cys Gly Thr Thr Pro Glu His He Ala Ala Met Ser Arg 
340 345 350 

Ala Val Ala Gly Leu Leu Pro Arg Gin Leu Pro Asp He Pro Val Ala 
355 360 365 

Cys Arg Leu Ser Gly Leu Glu Pro Leu Asn He Gly Asp Asp Ser Leu 
370 375 380 

Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Ala Lys Phe 
385 390 395 400 

Lys Arg Leu He Lys Glu Glu Lys Tyr Ser Glu Ala Leu Asp Val Ala 
405 410 415 

Arg Gin Gin Val Glu Ser Gly Ala Gin He He Asp He Asn Met Asp 
420 425 430 

Glu Gly Met Leu Asp Ala Glu Ala Ala Met Val Arg Phe Leu Ser Leu 
435 440 445 

He Ala Gly Glu Pro Asp He Ala Arg Val Pro He Met He Asp Ser 
450 455 460 

Ser Lys Trp Glu Val He Glu Lys Gly Leu Lys Cys He Gin Gly Lys 
465 470 475 480 

Gly He Val Asn Ser He Ser Met Lys Glu Gly Val Glu Ala Phe He 
485 490 495 

His His Ala Lys Leu Leu Arg Arg Tyr Gly Ala Ala Val Val Val Met 
500 505 510 

Ala Phe Asp Glu Gin Gly Gin Ala Asp Thr Arg Ala Arg Lys He Glu 
515 520 525 
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He Cys Arg Arg Ala Tyr Lys He Leu Thr Glu Glu Val Gly Phe Pro 
530 535 540 

Pro Glu Asp He He Phe Asp Pro Asn He Phe Ala Val Ala Thr Gly 
54 * 550 555 560 

He Glu Glu His Asn Abh Tyr Ala Gin Asp Phe He Gly Ala CyB Glu 
565 570 575 

Asp He Lys Arg Glu Leu Pro His Ala Leu He Ser Gly Gly Val Ser 
580 585 590 

Asn Val Ser Phe Ser Phe Arg Gly Asn Asp Pro Val Arg Glu Ala He 
595 600 605 

His Ala Val Phe Leu Tyr Tyr Ala He Arg Asn Gly Met Asp Met Gly 
610 615 620 

He Val Asn Ala Gly Gin Leu Ala He Tyr ABp Asp Leu Pro Ala Glu 
6 25 630 635 640 

Leu Arg Asp Ala Val Glu Asp Val He Leu Asn Arg Arg Asp Asp Gly 
645 650 655 

Thr Glu Arg Leu Leu Asp Leu Ala Glu Lys Tyr Arg Gly Ser Lys Thr 
660 665 670 

Asp Glu Ala Ala Asn Ala Gin Gin Ala Glu Trp Arg Ser Trp Asp Val 
675 680 685 

Lys Lys Arg Leu Glu Tyr Ser Leu Val Lys Gly He Thr Glu Phe He 
690 695 700 

Glu Gin Asp Thr Glu Glu Ala Arg Gin Gin Ala Ala Arg Pro He Glu 
7 05 710 715 ^ 720 

Val lie Glu Gly Pro Leu Met Asp Gly Met Asn Val Val Gly Asp Leu 
725 730 735 

Phe Gly Glu Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala Arg 
740 745 750 

Val Met Lys Gin Ala Val Ala Tyr Leu Glu Pro Phe He Glu Ala Ser 
755 760 765 

Lys Glu Lys Gly Ser Ser Asn Gly Lys Met Val He Ala Thr Val Lys 
770 775 780 

Gly Asp Val His Asp He Gly Lys Asn lie Val Gly Val Val Leu Gin 
785 790 795 800 

Cys Asn Asn Tyr Glu lie Val Asp Leu Gly Val Met Val Pro Ala Glu 
805 810 815 

Lys lie Leu Arg Thr Ala Arg Glu Val Asn Ala Asp Leu lie Gly Leu 
820 825 830 

Ser Gly Leu lie Thr Pro Ser Leu Asp Glu Met Val Asn Val Ala Lys 
835 840 845 

Glu Met Glu Arg Gin Gly Phe Thr lie Pro Leu Leu He Gly Gly Ala 
850 855 860 
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Thr Thr Ser Lys Ala Hie Thr Ala Val Lye He Olu Gin Asn. Tyr Ser 
865 870 875 880 

Gly' Pro Thr Val Tyr Val Gin Asn Ala Ser Arg Thr Val Gly Val Val 
885 890 895 

Ala Ala Leu Leu Ser Asp Thr Gin Arg Asp Asp Phe Val Ala Arg Thr 
900 905 910 

Arg Lys Glu Tyr Glu Thr Val Arg He Gin His Ala Arg Lys Lys Pro 
915 920 925 

Arg Thr Pro Pro Val Thr Leu Glu Ala Ala Arg Asp Asn Asp Leu Ala 
930 935 940 

Phe Asp Trp Glu Arg Tyr Thr Pro Pro Val Ala His Arg Leu Gly Val 
945 950 955 960 

Gin Glu Val Glu Ala Ser He Glu Thr Leu Arg Asn Tyr He Asp Trp 
965 970 975 

Thr Pro Phe Phe Met Thr Trp Ser Leu Ala Gly Lys Tyr Pro Arg He 
980 985 990 

Leu Glu Asp Glu Val Val Gly Val Glu Ala Gin Arg Leu Phe Lys Asp 
995 1000 1005 

Ala ABn Asp Net Leu Asp Lys Leu Ser Ala Glu Lys Leu Leu Asn Pro 
1010 1015 1020 

Arg Gly Val Val Gly Leu Phe Pro Ala Asn Arg Val Gly Asp Asp He 
1025 1030 1035 1040 

Glu He Tyr Arg Asp Glu Thr Arg Thr His Val Leu Thr Val Ser His 
1045 1050 1055 

His Leu Arg Gin Gin Thr Glu Lys Val Gly Phe Ala Asn Tyr Cys Leu 
1060 1065 1070 

Ala Asp Phe Val Ala Pro Lys Leu Ser Gly Lys Ala Asp Tyr He Gly 
1075 1080 1085 

Ala Phe Ala Val Thr Gly Gly Leu Glu Glu Asp Ala Leu Ala Asp Ala 
1090 1095 1100 

Phe Glu Ala Gin His Asp Asp Tyr Asn Lys He Met Val Lys Ala He 
1105 1110 1115 1120 

Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Tyr Leu His Glu Arg Val 
1125 1130 1135 

Arg Lys Val Tyr Trp Gly Tyr Ala Pro Asn Glu Ser Leu Ser Asn Asp 
1140 1145 1150 

Glu Leu He Arg Glu Asn Tyr Gin Gly He Arg Pro Ala Pro Gly Tyr 
1155 1160 " ~ 1165 

Pro Ala Cys Pro Glu His Thr Glu Lys Gly Thr He Trp Gin Leu Leu 
1170 1175 1180 

Asp Val Glu Lys His Thr Gly Met Lys Leu Thr Glu Ser Phe Ala Met 



WO 03/087386 PCT/EP03/04010 

97 

1185 1190 H95 1200 

Trp Pro Gly Ala Ser Val Ser Gly Trp Tyr Phe Ser His Pro Glu Ser 
1205 1210 1215 

Lye Tyr Phe Ala Val Ala Gin He Gin Arg Asp Gin Val Thr Asp Tyr 
1220 1225 1230 

Ala Phe Arg Lys Gly Met Ser Val Glu Asp Val Glu Arg Trp Leu Ala 
1235 1240 1245 

Pro Asn Leu Gly Tyr Asp Ala Asp 
1250 1255 



<210> 23 
<211> 3771 
<212> DNA 

<213> Salmonella typhi 

<220> 
<221> CDS 
<222> (1)..(3768) 
<223> RTY036B6 

<400> 23 

atg tct cat gtt gcc cgt tgt tct ctt ttc cgc cag cac get ttg tgc 48 

Met Ser His Val Ala Arg Cys Ser Leu Phe Arg Gin His Ala Leu Cys 
1 5 io is 

cag tat ggc teg tta cgt gga gcg ttg teg gga gcg agt gtg age age 96 
Gin Tyr Gly Ser Leu Arg Gly Ala Leu Ser Gly Ala Ser Val Ser Ser 
20 25 30 

aaa gtt gaa caa ctg cgt gcg cag tta aat gaa cgt att ctg gtg ctg 144 
Lys Val Glu Gin Leu Arg Ala Gin Leu Asn Glu Arg He Leu Val Leu 
35 40 45 



gac ggc ggt atg ggc acc atg ate cag age tat cgt eta cat gaa gaa 
Asp Gly Gly Met Gly Thr Met He Gin Ser Tyr Arg Leu His Glu Glu 
50 55 60 

gat ttc cgc ggg gag cgc ttt gcc gac tgg ccc tgc gac ctg aaa ggc 
Asp Phe Arg Gly Glu Arg Phe Ala Asp Trp Pro Cys Asp Leu Lys Gly 
65 70 .75 80 



192 



240 



aac aat gac ctg ctg gtc etc age aag ccg gag gtg ate gcc get ate 288 
Asn Asn Asp Leu Leu Val Leu Ser Lys Pro Glu Val He Ala Ala He 
85 90 95 

cac aac gcc tac ttt gag get ggc gcg gat ate ate gaa acc aac acc 336 
His Asn Ala Tyr Phe Glu Ala Gly Ala Asp He He Glu Thr Asn Thr 
100 105 no 

ttt aac teg aca acc att gcg atg gcg gat tac egg atg gaa tec ctg 384 
Phe Asn Ser Thr Thr He Ala Met Ala Asp Tyr Arg Met Glu Ser Leu 
115 120 125 

teg gcg gaa att aac tat gcg gcg gcc aaa ctg gcg cgc gcc tgc gcc 432 
Ser Ala Glu He Asn Tyr Ala Ala Ala Lys Leu Ala Arg Ala Cys Ala 
130 135 140 
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gat gaa tgg acg gcg cga aca cca gaa aaa cca cgc ttt gtt gcg ggc 480 
Asp Glu Trp Thr Ala Arg Thr Pro Glu Lys Pro Arg Phe Val Ala Gly 
145 150 155 160 

gt$ ctt ggt cca act aac cgc acg gcc tec att teg ccg gac gtc aac 528 
Val Leu Gly Pro Thr Asn Arg Thr Ala Ser He Ser Pro Asp Val Asn 
165 170 175 

gac ccg gcg ttt cgt aat ate ace ttc gat cag ctg gtg gcg gcc tac 576 
Asp Pro Ala Phe Arg Asn He Thr Phe. Asp Gin Leu Val Ala Ala Tyr 
180 185 190 

cgt gaa tec ace aaa gcg ctg gtg gaa ggc ggg gcg gac ctg ate ctg 624 
Arg Glu Ser Thr Lys Ala Leu Val Glu Gly Gly Ala Asp Leu He Leu 
19 5 200 205 



att gaa act gtc ttc gac ace etc aac gcc aaa gcg gcg gtg ttt gcg 
He Glu Thr Val Phe Asp Thr Leu Asn Ala Lys Ala Ala Val Phe Ala 
210 215 220 

gtg aaa gaa gag ttt gaa gcg ctg ggc gtt gat ctg ccg ate atg att 
Val Lys Glu Glu Phe Glu Ala Leu Gly Val Asp Leu Pro He Met He 
225 230 235 240 

tec ggc ace ate acc gac gcc tct ggc cgt acg ctt tec ggc cag acg 
Ser Gly Thr He Thr Asp Ala Ser Gly Arg Thr Leu Ser Gly Gin Thr 
245 250 255 

acc gaa gcc ttt tat aac teg ctg cgc cac gcc gag gcg etc act ttt 816 
Thr Glu Ala Phe Tyr Asn Ser Leu Arg His Ala Glu Ala Leu Thr Phe 
260 265 270 

ggc ctt aac tgc gcg ctg ggg cca gat gaa ctg cgc cag tac gtc cag 864 
Gly Leu Asn Cys Ala Leu Gly Pro Asp Glu Leu Arg Gin Tyr Val Gin 



275 280 



285 



672 



720 



768 



gaa ctg teg egg att gcc gaa tgc tac gtc acc gcg cac ccg aac gcc 912 
Glu Leu Ser Arg He Ala Glu Cys Tyr Val Thr Ala His Pro Asn Ala 
290 295 300 

ggc ctg ccg aac get ttc ggc gag tac gac etc gac gcc gac acc atg 960 
Gly Leu Pro Asn Ala Phe Gly Glu Tyr Asp Leu Asp Ala Asp Thr Met 
305 310 3i5 320 

gcg aaa cag att cgc gaa tgg gcg gaa gcg ggc ttc ctg aat ate gtt 1008 
Ala Lys Gin He Arg Glu Trp Ala Glu Ala Gly Phe Leu Asn He Val 
325 330 335 

ggc ggc tgc tgc ggc acc acg ccg gag cat att gcg gcg atg age cgc 1056 
Gly Gly Cys Cys Gly Thr Thr Pro Glu His He Ala Ala Met Ser Arg 
340 345 350 

gcc gtt gcc ggt ttg teg ccg cgc cag ctg ccg gat ate ccg gtg gcc 1104 
Ala Val Ala Gly Leu Ser Pro Arg Gin Leu Pro Asp He Pro Val Ala 
355 360 365 

tgc cgc ctt tec ggc ctg gag ccg ctg aac att ggt gac gat age ctg 1152 
Cys Arg Leu Ser Gly Leu Glu Pro Leu Asn He Gly Asp Asp Ser Leu 
370 375 380 

ttt gtc aac gtc ggc gaa cgt act aac gtc acc ggc teg gcc aaa ttt 1200 
Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Ala Lys Phe 



( 
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3 85 390 395 400 

aaa cgc ttg ate aaa gaa gag aaa tac age gaa gcg ctg gat gtc gee 1248 
Lys Arg Leu lie Lys Glu Glu Lye Tyr Ser Glu Ala Leu Asp Val Ala 
405 410 415 

cgt cag cag gtc gaa age ggc gcg cag att att gat ate aat atg gat 1296 
Arg Gin Gin Val Glu Ser Gly Ala Gin He He Asp He Asn Met Asp 
420 425 430 

gag ggg atg etc gac gee gaa gcg gcg atg gtg cgt ttc etc age ctg 1344 
Glu Gly Met Leu Asp Ala Glu Ala Ala Met Val Arg Phe Leu Ser Leu 
435 440 445 

att gec ggt gag ccg gac att gec cgt gta cca ate atg att gac tec 1392 
He Ala Gly Glu Pro Asp He Ala Arg Val Pro He Met He Asp Ser 
450 455 460 

tec aaa tgg gag gtt ate gaa aaa ggg ctg aag tgc att cag ggt aaa 1440 
Ser Lys Trp Glu Val He Glu Lys Gly Leu Lys Cys He Gin Gly Lys 
465 470 475 480 

ggc ate gtc aac tct att teg atg aaa gag ggc gtg gaa gee ttt att 1488 
Gly He Val Asn Ser He Ser Met Lys Glu Gly Val Glu Ala Phe He 
485 490 495 

cat cat gcg aag ttg eta cgt cgc tac ggc gec gca gtg gtg gtg atg 1536 
His His Ala Lys Leu Leu Arg Arg Tyr Gly Ala Ala Val Val Val Met 
500 505 510 

get ttt gat gag cag ggg cag gee gac ace cgc gaa cgt aaa ate gag 1584 
Ala Phe Asp Glu Gin Gly Gin Ala Asp Thr Arg Glu Arg Lys He Glu 
515 520 525 

att tgc cgc cgc get tac aaa att ttg etc gaa gag gta ggc ttt ccg 1632 
He Cys Arg Arg Ala Tyr Lys lie Leu Leu Glu Glu Val Gly Phe Pro 
530 535 540 



ccg gaa gac ate ate ttc gac ccg aat ate ttc gee gtc gee ace ggt 
Pro Glu Asp He He Phe Asp Pro Asn He Phe Ala Val Ala Thr Gly 
545 550 555 560 



1680 



att gaa gag cac aac aac tac gcg cag gac ttt ate ggc get tgt gaa 1728 
He Glu Glu His Asn Asn Tyr Ala Gin Asp Phe He Gly Ala Cys Glu 
565 570 575 

gac ate aaa cgc gag ctg ccg cac gcg ctg ate tec ggc ggc gtg tct 1776 
Aep He Lys Arg Glu Leu Pro His Ala Leu He Ser Gly Gly Val Ser 
580 ' 585 590 

aac gtg tec ttc teg ttt cgc ggc aac gac ccg gta cgt gag get ate 1824 
Asn Val Ser Phe Ser Phe Arg Gly Asn Asp Pro Val Arg Glu Ala He 
595 600 605 

cac gcg gta ttc etc tac tac gee ate cgc aac ggc atg gac atg ggc 1872 
His Ala Val Phe Leu Tyr Tyr Ala He Arg Asn Gly Met Asp Met Gly 
610 615 620 

ate gtc aac gee ggg caa ctg gcg att tat gac aac ctg cct gee gaa 1920 
He Val Asn Ala Gly Gin Leu Ala He Tyr Asp Asn Leu Pro Ala Glu 
625 630 635 640 
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ctg cgc gat gca gtt gaa gat gtc att ctt aac cgt cgc gat gac ggc 1968 
Leu Arg Asp Ala Val Glu Asp Val lie Leu Asn Arg Arg Asp Asp Gly 
645 650 655 

acc' gag cgt ttg ctg gat ttg gcg gag aaa tat cgc ggc age aaa acc 2016 
Thr Glu Arg Leu Leu Asp Leu Ala Glu Lys Tyr Arg Gly Ser Lys Thr 
660 665 *" 670 

gac gag get gec agt gec cag cag gcg gaa tgg cgt age tgg gac gtg 2064 
Asp Glu Ala Ala Ser Ala Gin Gin Ala Glu Trp Arg Ser Trp Asp Val 
675 6B0 685 

aaa aag cgt etc gaa tac teg ctg gtg aaa ggc att acc gag ttt ate 2112 
Lys Lys Arg Leu Glu Tyr Ser Leu Val Lys Gly He Thr Glu Phe He 
690 695 700 

gaa cag gat acc gaa gaa gee cgt cag cag gee gee cgc ccg att gag 2160 
Glu Gin Asp Thr Glu Glu Ala Arg Gin Gin Ala Ala Arg Pro He Glu 
705 710 715 720 

gtg att gaa ggg ccg ctg atg gac ggc atg aac gtg gtc ggc gac ctg 2208 
Val He Glu Gly Pro Leu Met Asp Gly Met Asn Val Val Gly Asp Leu 
725 730 735 

ttc ggc gaa ggg aaa atg ttc ctg ccg cag gtg gtg aaa tec get cgc 2256 
Phe Gly Glu Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala Arg 
740 745 750 

gtg atg aaa caa gcg gtg gec tac ctg gag ccg ttt att gaa gee age 2304 
Val Met Lys Gin Ala Val Ala Tyr Leu Glu Pro Phe He Glu Ala Ser 
755 760 765 

aaa gaa aaa ggc tec age aac ggc aag atg gtg att gee acc gtg aag 2352 
Lys Glu Lys Gly Ser Ser Asn Gly Lys Met Val He Ala Thr Val Lys 
770 775 780 

ggc gat gtg cac gac att ggc aag aac att gtc ggc gtg gtg ctg caa 2400 
Gly Asp Val His Asp lie Gly Lys Asn lie Val Gly Val Val Leu Gin 
785 790 795 800 

tgc aac aac tac gaa ate gtc gat ctt ggc gtg atg gtg cca gcg gag 2448 
Cys Asn Asn Tyr Glu He Val Asp Leu Gly Val Met Val Pro Ala Glu 
805 810 815 

aaa ate etc aga acg gcg cgt gaa gtg aat gee gat ctg att ggt ctt 2496 
Lys He Leu Arg Thr Ala Arg Glu Val Asn Ala Asp Leu He Gly Leu 
820 825 830 

tec ggg ctt ate acc ccg teg ctg gac gaa atg gtc aac gtg gcg aaa 2544 
Ser Gly Leu He Thr Pro Ser Leu Asp Glu Met Val Asn Val Ala Lys 
835 840 845 

gag atg gag cgt cag ggc ttt act ate ccg eta ctg ate ggc ggc gca 2592 
Glu Met Glu Arg Gin Gly Phe Thr He Pro Leu Leu He Gly Gly Ala 
850 855 860 

acc act teg aaa gcg cat acg gcg gtg aaa ate gag cag aac tac age 2640 
Thr Thr Ser Lys Ala His Thr Ala Val Lys He Glu Gin Asn Tyr Ser 
865 870 875 880 

ggt ccg acg gtc tac gtg cag aat get teg cgt acc gtg ggc gtg gtg 2688 
Gly Pro Thr Val Tyr Val Gin Asn Ala Ser Arg Thr Val Gly Val Val 
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885 



101 



890 



895 



gcg gcg eta etc tec gac ace cag cgt gat gac ttt gtc gee cgt acc 
Ala Ala Leu Leu Ser Asp Thr Gin Arg Asp Asp Phe Val Ala Arg Thr 
900 905 910 



2736 



cgc aaa gag tac gaa acc gtg cgt att cag cac gec cgc aaa aaa ccg 
Arg Lys Glu Tyr Glu Thr Val Arg He Gin His Ala Arg Lys Lys Pro 
915 920 925 



2784 



cgc acg ccg ccg gtc acg ctg gaa gcg gcg cgc gat aat gat ctg gca 
Arg Thr Pro Pro Val Thr Leu Glu Ala Ala Arg Asp Asn Asp Leu Ala 
930 935 940 



2832 



ttt gat tgg gaa cgc tac acc ccg ccg gta gec cac cgt ctg ggc gtg 2880 
Phe Asp Trp Glu Arg Tyr Thr Pro Pro Val Ala His Arg Leu Gly Val 
945 950 955 960 

cag gag gtg gaa gec age ate gaa acg ctg cgc aac tac ate gac tgg 2928 
Gin Glu Val Glu Ala Ser He Glu Thr Leu Arg Asn Tyr He Asp Trp 
965 970 975 



acg ccg ttc ttt atg acc tgg teg ctg gee ggc aaa tac ccg cgc att 
Thr Pro Phe Phe Met Thr Trp Ser Leu Ala Gly Lys Tyr Pro Arg He 
980 985 990 



2976 



ctg gaa gat gag gtg gtg ggc gtt gag gcg cag cgt ctg ttt aaa gac 
Leu Glu Asp Glu Val Val Gly Val Glu Ala Gin Arg Leu Phe Lys Asp 
995 1000 1005 



3024 



gec aat gat atg ctg gat aaa ctg age gec gag aaa ctg ttg aat ccg 3072 
Ala Asn Asp Met Leu Asp Lys Leu Ser Ala Glu Lys Leu Leu Asn Pro 
1010 1015 1020 

cgt ggc gtg gtg ggc ctg ttc ccg gcg aac cgt gtg ggt gac gac ate 3120 
Arg Gly Val Val Gly Leu Phe Pro Ala Asn Arg Val Gly Asp Asp lie 
10 25 1030 1035 1040 

gaa ate tat cgc gac gaa acc cgt act cat gtt ctg acg gtc age cac 3168 
Glu He Tyr Arg Asp Glu Thr Arg Thr His Val Leu Thr Val Ser His 
1045 1050 1055 

cac ctg cgc cag cag acc gag aaa gtt ggt ttt get aac tac tgt ctg 3216 
His Leu Arg Gin Gin Thr Glu Lys Val Gly Phe Ala Asn Tyr Cys Leu 
1060 1065 1070 

gcg gat ttt gtc gcg ccg aaa ctg age ggc aaa gcg gac tac ate ggt 3264 
Ala Asp Phe Val Ala Pro Lys Leu Ser Gly LyB Ala Asp Tyr He Gly 
1075 1080 1085 



get ttc gcg gtg acc ggc ggt ctg aag gag gat gcg ctg gcg gac gec 
Ala Phe Ala Val Thr Gly Gly Leu Lys Glu Asp Ala Leu Ala Asp Ala 
1090 1095 lioo 



3312 



ttc gaa gcg caa cac gac gac tat aac aag ate atg gtg aaa gcg att 3360 
Phe Glu Ala Gin His Asp Asp Tyr Asn Lys He Met Val Lys Ala He 
1 1°5 1110 ins H20 

gee gac cgt ctg gcg gaa gcg ttt gee gag tat ctg cat gag cgt gta 3408 
Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Tyr Leu HiB Glu Arg Val 
1125 1130 H35 
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cgt aag gtt tac tgg gga tat gcg ccg aac gag age ctg agt aac gac 3456 
Arg Lys Val Tyr Trp Gly Tyr Ala Pro Asn Glu Ser Leu Ser Asn Asp 
1140 1145 1150 

gaa' tta ate cgc gaa aac tac cag ggg att cgc ccg gcg ccg ggt tat 3504 
Glu Leu He Arg Glu Asn Tyr Gin Gly He Arg Pro Ala Pro Gly Tyr 
1155 1160 1165 

cct gec tgc ccg gaa cat acc gaa aaa ggc act ate tgg cag eta ctg 3552 
Pro Ala Cys Pro Glu His Thr Glu Lys Gly Thr lie Trp Gin Leu Leu 
1170 1175 1180 

gat gtc gaa aaa cac acc ggg atg aag etc acc gaa tct etc gec atg 3600 
Asp Val Glu Lye His Thr Gly Met Lys Leu Thr Glu Ser Phe Ala Met 
H85 1190 1195 1200 

tgg cct ggc gcg teg gtc tec ggc tgg tac ttc age cat cct gag age 3648 
Trp Pro Gly Ala Ser Val Ser Gly Trp Tyr Phe Ser His Pro Glu Ser 
1205 1210 1215 

aaa tac ttc gcg gta gcg cag ate caa cgc gat cag gtg aca gat tat 3696 
Lys Tyr Phe Ala Val Ala Gin He Gin Arg Asp Gin Val Thr Asp Tyr 
1220 1225 1230 

get ttc cgt aaa gga atg. age gtt gag gac gtt gag egg tgg etc gcg 3744 
Ala Phe Arg Lys Gly Met Ser Val Glu Asp Val Glu Arg Trp Leu Ala 
1235 1240 1245 

ccg aac ctg ggt tac gat gcg gac tga 3771 
Pro Asn Leu Gly Tyr Asp Ala Asp 
1250 1255 



<210> 24 
<211> 1256 
<212> PRT 

<213> Salmonella typhi 
<400> 24 

Met Ser His Val Ala Arg Cys Ser Leu Phe Arg Gin His Ala Leu Cys 
1 5 10 15 

Gin Tyr Gly Ser Leu Arg Gly Ala Leu Ser Gly Ala Ser Val Ser Ser 
20 25 30 

Lys Val Glu Gin Leu Arg Ala Gin Leu Asn Glu Arg He Leu Val Leu 
35 40 ~ 45 

Asp Gly Gly Met Gly Thr Met He Gin Ser Tyr Arg Leu His Glu Glu 
50 55 60 

Asp Phe Arg Gly Glu Arg Phe Ala Asp Trp Pro Cys Asp Leu Lys Gly 
65 70 75 80 

Asn Asn Asp Leu Leu Val Leu Ser Lys Pro Glu Val He Ala Ala He 
65 90 95 

His Asn Ala Tyr Phe Glu Ala Gly Ala Asp He He Glu Thr Asn Thr 
100 105 no 

Phe Asn Ser Thr Thr He Ala Met Ala Asp Tyr Arg Met Glu Ser Leu 
115 120 125 



I 
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Ser Ala Glu lie Asn Tyr Ala Ala Ala Lys Leu Ala Arg Ala Cya Ala 
130 135 140 

Asp Glu Trp Thr Ala Arg Thr Pro Glu Lys Pro Arg Phe Val Ala Gly 
145 150 155 160 

Val Leu Gly Pro Thr Asn Arg Thr Ala Ser He Ser Pro Asp Val Asn 
165 170 175 

Asp Pro Ala Phe Arg Asn He Thr Phe Asp Gin Leu Val Ala Ala Tyr 
180 185 190 

Arg Glu Ser Thr Lye Ala Leu Val Glu Gly Gly Ala Asp Leu He Leu 
195 200 205 

He Glu Thr Val Phe Asp Thr Leu Aim Ala Lys Ala Ala Val Phe Ala 
210 215 220 

Val Lys Glu Glu Phe Glu Ala Leu Gly Val Asp Leu Pro He Met He 
225 230 235 240 

Ser Gly Thr He Thr Asp Ala Ser Gly Arg Thr Leu Ser Gly Gin Thr 
245 250 255 

Thr Glu Ala Phe Tyr Asn Ser Leu Arg His Ala Glu Ala Leu Thr Phe 
260 265 270 

Gly Leu Asn Cys Ala Leu Gly Pro Asp Glu Leu Arg Gin Tyr Val Gin 
275 280 285 

Glu Leu Ser Arg He Ala Glu Cys Tyr Val Thr Ala His Pro Asn Ala 
290 295 300 

Gly Leu Pro Asn Ala Phe Gly Glu Tyr Asp Leu Asp Ala Asp Thr Met 
305 310 315 320 

Ala Lys Gin He Arg Glu Trp Ala Glu Ala Gly Phe Leu Asn He Val 
325 330 335 

Gly Gly Cys Cys Gly Thr Thr Pro Glu His He Ala Ala Met Ser Arg 
340 345 350 

Ala Val Ala Gly Leu Ser Pro Arg Gin Leu Pro Asp He Pro Val Ala 
355 360 365 

Cye Arg Leu Ser Gly Leu Glu Pro Leu Asn He Gly Asp Asp Ser Leu 
370 375 380 

Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Ala Lys Phe 
385 390 395 400 

Lys Arg Leu He Lys Glu Glu Lys Tyr Ser Glu Ala Leu Asp Val Ala 
405 410 415 

Arg Gin Gin Val Glu Ser Gly Ala Gin He He Asp He Asn Met Asp 
420 425 430 

Glu Gly Met Leu Asp Ala Glu Ala Ala Met Val Arg Phe Leu Ser Leu 
435 440 445 

He Ala Gly Glu Pro Asp He Ala Arg Val Pro He Met He Asp Ser 
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450 455 460 

Ser Lys Trp Glu Val He Glu Lys Gly Leu Lys Cys He Gin Gly Lys 
465 470 475 480 

Gly He Val Asn Ser He Ser Met Lys Glu Gly Val Glu Ala P^e He 
485 490 495 

His His Ala Lys Leu Leu Arg Arg Tyr Gly Ala Ala Val Val Val Met 
500 505 510 

Ala Phe Asp Glu Gin Gly Gin Ala Asp Thr Arg Glu Arg Lys He Glu 
515 520 525 

He Cys Arg Arg Ala Tyr Lys He Leu Leu Glu Glu Val Gly Phe Pro 
530 535 540 

Pro Glu Asp He He Phe Asp Pro Asn He Phe Ala Val Ala Thr Gly 
545 550 555 560 

He Glu Glu His Asn Asn Tyr Ala Gin Asp Phe He Gly Ala Cys Glu 
565 570 * 575 

Asp He Lys Arg Glu Leu Pro His Ala Leu He Ser Gly Gly Val Ser 
560 585 590 

Asn Val Ser Phe Ser Phe Arg Gly Asn Asp Pro Val Arg Glu Ala He 
595 600 605 

His Ala Val Phe Leu Tyr Tyr Ala He Arg Asn Gly Met Ab P Met Gly 
610 615 620 

He Val Asn Ala Gly Gin Leu Ala He Tyr Asp Asn Leu Pro Ala Glu 
625 630 635 640 

Leu Arg Asp Ala Val Glu Asp Val He Leu Asn Arg Arg Asp Asp Gly 
64 5 650 655 

Thr Glu Arg Leu Leu Asp Leu Ala Glu Lys Tyr Arg Gly Ser Lys Thr 
660 665 670 

Asp Glu Ala Ala Ser Ala Gin Gin Ala Glu Trp Arg Ser Trp Asp Val 
67 5 680 685 

Lys Lys Arg Leu Glu Tyr Ser Leu Val Lys Gly He Thr Glu Phe He 
690 695 700 

Glu Gin Asp Thr Glu Glu Ala Arg Gin Gin Ala Ala Arg Pro He Glu 
705 710 715 720 

Val He Glu Gly Pro Leu Met Asp Gly Met Asn Val Val Gly Asp Leu 
725 730 735 

Phe Gly Glu Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala Arg 
74 ° 745 750 

Val Met Lys Gin Ala Val Ala Tyr Leu Glu Pro Phe He Glu Ala Ser 
7 55 760 765 

Lys Glu Lys Gly Ser Ser Asn Gly Lys Met Val He Ala Thr Val Lys 
77 0 775 780 
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Gly Asp Val His Asp He Qly Lys Asn He Val Gly Val Val Leu Gin 
785 790 795 800 

Cys Asn Asn Tyr Glu He Val Asp Leu Gly Val Met Val Pro Ala Glu 
80S 810 815 

Lys He Leu Arg Thr Ala Arg Glu Val Asn Ala Asp Leu He Gly Leu 
820 825 830 

Ser Gly Leu He Thr Pro Ser Leu Asp Glu Met Val Asn Val Ala Lys 
835 840 845 

Glu Met Glu Arg Gin Gly Phe Thr He Pro Leu Leu He Gly Gly Ala 
850 855 860 

Thr Thr Ser Lys Ala His Thr Ala Val Lys He Glu Gin Asn Tyr Ser 
865 870 875 880 

Gly Pro Thr Val Tyr Val Gin Asn Ala Ser Arg Thr Val Gly Val Val 
885 890 895 

Ala . Ala Leu Leu Ser Asp Thr Gin Arg Asp Asp Phe Val Ala Arg Thr 
900 905 9io 

Arg Lys Glu Tyr Glu Thr Val Arg He Gin His Ala Arg Lys Lys Pro 
915 920 925 

Arg Thr Pro Pro Val Thr Leu Glu Ala Ala Arg Ab P Asn Asp Leu Ala 
930 935 940 

Phe Asp Trp Glu Arg Tyr Thr Pro Pro Val Ala His Arg Leu Gly Val 
945 950 955 960 

Gin Glu Val Glu Ala Ser He Glu Thr Leu Arg Asn Tyr He Asp Trp 
965 970 975 

Thr Pro Phe Phe Met Thr Trp Ser Leu Ala Gly Lys Tyr Pro Arg He 
980 985 990 

Leu Glu Asp Glu Val Val Gly Val Glu Ala Gin Arg Leu Phe Lys Asp 
995 1000 1005 

Ala Asn Asp Met Leu Asp Lys Leu Ser Ala Glu Lys Leu Leu Asn Pro 
!010 1015 1020 

Arg Gly Val Val Gly Leu Phe Pro Ala Asn Arg Val Gly Asp Asp He 
1025 1030 1035 1040 

Glu He Tyr Arg Asp Glu Thr Arg Thr His Val Leu Thr Val Ser His 
1045 1050 1055 

His Leu Arg Gin Gin Thr Glu Lys Val Gly Phe Ala Asn Tyr Cys Leu 
1060 1065 1070 

Ala Asp Phe Val Ala Pro Lys Leu Ser Gly Lys Ala Asp Tyr He Gly 
1075 1080 1085 

Ala Phe Ala Val Thr Gly Gly Leu Lys Glu Asp Ala Leu Ala Asp Ala 
1090 1095 iioo 

Phe Glu Ala Gin His Asp Asp Tyr Asn Lys He Met Val Lys Ala He 
1105 mo ins 1120 
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Ala Asp Arg Leu Ala Olu Ala Phe Ala Glu Tyr Leu His Glu Arg Val 
1125 1130 H35 

Arg' Lys Val Tyr Trp Gly Tyr Ala Pro Asn Glu Ser Leu Ser' Asn Asp 
1140 H45 H50 

Glu Leu He Arg Glu Asn Tyr Gin Gly He Arg Pro Ala Pro Gly Tyr 
1155 1160 H65 

Pro Ala Cys Pro Glu His Thr Glu Lys Gly Thr He Trp Gin Leu Leu 
1170 H75 1180 

Asp Val Glu Lys His Thr Gly Met Lys Leu Thr Glu Ser Phe Ala Met 
H85 1190 H95 1200 

Trp Pro Gly Ala Ser Val Ser Gly Trp Tyr Phe Ser His Pro Glu Ser 
1205 1210 1215 

Lys Tyr Phe Ala Val Ala Gin He Gin Arg Asp Gin Val Thr Asp Tyr 
1220 1225 1230 

Ala Phe Arg Lys Gly Met Ser Val Glu Asp Val Glu Arg Trp Leu Ala 
1235 1240 1245 

Pro Asn Leu Gly Tyr Asp Ala Asp 
1250 1255 



<210> 25 
<211> 3711 
<212> DKA 

<213> Pseudomonas f lucres cens 

<220> 
<221> CDS 
<222> (1) . . (370B) 
<223> RPU03563 

<400> 25 

atg tec gat cgc age gtc 

Met Ser Asp Arg Ser Val 

1 " 5 

gag cgc ate ctg att etc 
Glu Arg He Leu He Leu 
20 

tac aag etc gaa gag cag 
Tyr Lys Leu Glu Glu Gin 
35 

ccg age gac gtc aag ggc 
Pro Ser Asp Val Lys Gly 
50 

gac gtg ate ggc ggc ate 
Asp Val He Gly Gly He 
65 70 

ate etc gag ace aac acc 
He Leu Glu Thr Asn Thr 



cgc ctt caa get etc aag caa get etc aaa 48 
Arg Leu Gin Ala Leu Lys Gin Ala Leu Lys 
10 15 

gac ggc ggc atg ggc acg atg ate cag age 96 
Asp Gly Gly Met Gly Thr Met He Gin Ser 
25 30 

gat tat cgc ggc aaa cgc ttc gee gac tgg 144 
Asp Tyr Arg Gly Lys Arg Phe Ala Asp Trp 
40 45 

aac aac gac ctg ttg gtg ctg acc cgc ccg 192 
Asn Asn Asp Leu Leu Val Leu Thr Arg Pro 
55 60 

gag aaa gee tat ctg gat gee ggt gee gac 240 
Glu Lys Ala Tyr Leu Asp Ala Gly Ala Asp 
75 80 

ttc aac gee acg cag att tec atg gee gac 286 
Phe Asn Ala Thr Gin He Ser Met Ala Asp 



{ 
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85 90 95 

tac ggc atg gaa gaa ctg gtc tac gaa etc aac gta gaa ggc gec cgt 336 
Tyr Gly Met Glu Glu Leu Val Tyr Glu Leu Asn Val Glu Gly Ala Arg 
100 105 110 

ctg gca cgc aag gtc gec gac gcg aaa acc etc gag acc ccc gac aag 384 
Leu Ala Arg Lys Val Ala Asp Ala Lys Thr Leu Glu Thr Pro Asp Lys 
115 120 125 

ccg cgc ttc gtc gec ggc gtt etc ggc ccg acc age cgc acc tgc teg 432 
Pro Arg Phe Val Ala Gly Val Leu Gly Pro Thr Ser Arg Thr Cys Ser 
130 135 140 

ctg teg ccg gac gtc aac aac ccg ggc tat cgc aac gtc acc ttc gat 480 
Leu Ser Pro Asp Val Asn Asn Pro Gly Tyr Arg Asn Val Thr Phe Asp 
145 150 155 160 

gag ctg gtc gaa aac tac acc gag gee acc aaa ggc ctg ate gag ggc 528 
Glu Leu Val Glu Asn Tyr Thr Glu Ala Thr Lys Gly Leu He Glu Gly 
165 170 175 

ggc gcg gat ctg ate ctg ate gaa acc ate ttc gac acc etc aac gec 576 
Gly Ala Asp Leu He Leu He Glu Thr lie Phe Asp Thr Leu Asn Ala 
180 185 190 

aaa gee gcg ate ttc gee gtg caa ggc gtg ttc gaa gaa ctg ggc ttc 624 
Lys Ala Ala He Phe Ala Val Gin Gly Val Phe Glu Glu Leu Gly Phe 
195 200 205 

gaa ttg ccg ate atg ate tec ggc acc ate acc gac gec tec ggc cgt 672 
Glu Leu Pro He Met He Ser Gly Thr He Thr Asp Ala Ser Gly Arg 
210 215 220 

acc ctg teg ggc cag acc acc gaa gcg ttc tgg aac tec gtg get cac 720 
Thr Leu Ser Gly Gin Thr Thr Glu Ala Phe Trp Asn Ser Val Ala His 
225 230 235 240 

gec aaa ccg att tec gtc ggt ctt aac tgc gee etc ggc gee cgc gaa 768 
Ala Lys Pro He Ser Val Gly Leu Asn Cys Ala Leu Gly Ala Arg Glu 
245 250 255 

ctg cgt ccg tac ctg gaa gag ctg teg gac aag gee age acc cac gtt 816 
Leu Arg Pro Tyr Leu Glu Glu Leu Ser Asp Lys Ala Ser Thr His Val 
260 265 270 

teg gcg cac ccg aac gee ggc ctg ccg aac gaa ttc ggc gag tac gac 864 
Ser Ala His Pro Asn Ala Gly Leu Pro Asn Glu Phe Gly Glu Tyr Asp 
275 280 285 

gag ctg ccg gtg gac acc gee aag gtc ate gaa gag ttc gee cag age 912 
Glu Leu Pro Val Asp Thr Ala Lys Val He Glu Glu Phe Ala Gin Ser 
290 295 300 

ggt ttc etc aac ate gtc ggc ggt tgc tgc ggc acc acg ccg ggc cat 960 
Gly Phe Leu Asn He Val Gly Gly Cys Cys Gly Thr Thr Pro Gly His 
305 310 315 320 

ate gaa gec ate gee aaa gee gtt gee ggt tac gcg cca egg cag att 1008 
He Glu Ala He Ala Lys Ala Val Ala Gly Tyr Ala Pro Arg Gin He 
325 330 335 
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ccg gac att ccc aag gcc tgc cgc ctg teg ggt ctg gaa ccg ttc acc 1056 
Pro Asp He Pro Lys Ala Cys Arg Leu Ser Gly Leu Glu Pro. Phe Thr 
340 345 350 

att' gat cgc age teg ctg ttc gtc aac gtc ggc gag egg acc aac ate 1104 
He Asp Arg Ser Ser Leu Phe Val Asn Val Gly Glu Arg Thr Aan He 
3S5 360 365 

acc ggg tec gcg aaa ttt gcc egg ctg ate cgt gaa gac aac tac acc 1152 
Thr Gly Ser Ala Lys Phe Ala Arg Leu He Arg Glu Asp Asn Tyr Thr 
370 375 380 

gaa gcc ctg gaa gtc gcc ctg cag cag gtc gag gcc ggc gcc cag gtg 1200 
Glu Ala Leu Glu Val Ala Leu Gin Gin Val Glu Ala Gly Ala Gin Val 
385 390 395 400 

ate gac ate aac atg gac gaa ggg atg etc gat teg aag aag gcc atg 1248 
He Asp He Asn Met Asp Glu Gly Met Leu Asp Ser Lys Lys Ala Met 
405 410 415 

gtg acc ttc etc aat ctg att gcc ggc gaa ccg gac ate tec cgc gta 1296 
Val Thr Phe Leu Asn Leu He Ala Gly Glu Pro Asp He Ser Arg Val 
420 425 430 

ccg ate atg ate gac tec teg aaa tgg gac gtg ate gaa gcc ggc etc 1344 
Pro lie Met He Asp Ser Ser LyB Trp Asp Val He Glu Ala Gly Leu 
435 440 445 

aag tgc att cag ggc aag ggc ate gtc aac teg ate age atg aaa gaa 1392 
Lys Cys He Gin Gly Lys Gly He Val Asn Ser He Ser Met Lys Glu 
450 45S 460 

ggc gtc gag cag ttc ate cac cac gcc aaa ctg tgc aag cgc tat ggc 1440 
Gly Val Glu Gin Phe He His His Ala Lys Leu Cys Lys Arg Tyr Gly 
465 470 475 480 

gcc gcc gtg gtg gtg atg gcg ttc gac gaa gcc ggc cag get gac acc 1488 
Ala Ala Val Val Val Met Ala Phe Asp Glu Ala Gly Gin Ala Asp Thr 
485 490 495 

gaa gcg cgc aag aaa gag ate tgc aaa cgc tec tac gac att ctg gtc 1536 
Glu Ala Arg Lys Lys Glu He Cys Lys Arg Ser Tyr Asp He Leu Val 
500 505 510 

aac gaa gtc ggc ttc ccg ccg gaa gac ate att ttc gac ccg aac ate 1584 
Asn Glu Val Gly Phe Pro Pro Glu Asp He He Phe Asp Pro Asn He 
515 520 525 

ttc gcc gtg gcc acc ggc ate gaa gaa cac aac aac tac get gtg gac 1632 
Phe Ala Val Ala Thr Gly He Glu Glu His Asn Asn Tyr Ala Val Asp 
530 535 540 

ttc ate aac gcc tgt gcc tac ate cgc gac gag ctg ccg tat gcc ctg 1680 
Phe He Asn Ala Cys Ala Tyr He Arg Asp Glu Leu Pro Tyr Ala Leu 
5 « 550 555 560 

age tec ggc ggc gtg tec aac gtg teg ttc teg ttc cgc ggc aac aac 1728 
Ser Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Asn 
565 570 575 

ccg gtg cgc gag gcg ate cac teg gtg ttc ctg ctg tac gcg ate cgc 1776 
Pro Val Arg Glu Ala He His Ser Val Phe Leu Leu Tyr Ala He Arg 



WO 03/087386 PCT/EP03/04010 

109 

580 505 590 

gcc ggc ctg acc atg ggt ate gtc aac gec ggt cag ctg gag ate tac 1824 
Ala Gly Leu Thr Met Gly lie Val Asn Ala Gly Gin Leu Glu He Tyr 
595 600 60S 

gac cag ate ccg cag gaa ctg cgc gac gcc gtt gaa gac gtg ate etc 1872 
Asp Gin He Pro Gin Glu Leu Arg Asp Ala Val Glu Asp Val He Leu 
610 615 620 

aac cgc acg ccg gaa ggc acc gac gcc etc etc gcc ate gcc gac aag 1920 
Asn Arg Thr Pro Glu Gly Thr Asp Ala Leu Leu Ala He Ala Asp Lys 
"5 630 635 640 

tac aag ggc gac ggc age gtc aag gaa gcc gag acc gaa gaa tgg cgc 1968 
Tyr Lys Gly Asp Gly Ser Val Lys Glu Ala Glu Thr Glu Glu Trp Arg 
645 650 655 

ggc tgg gac gtc aac aaa cgt ctg gaa cat gcg ctg gtc aag ggc ate 2016 
Gly Trp Asp Val Asn Lys Arg Leu Glu His Ala Leu Val Lys Gly He 
660 665 670 

acc acc cac ate gtc gaa gac acc gaa gaa tec cgt cag tec ttc gcc 2064 
Thr Thr His He Val Glu Asp Thr Glu Glu Ser Arg Gin Ser Phe Ala 
675 680 685 

cgc ccg ate gaa gtg ate gaa ggc ccg ctg atg tec ggc atg aac ate 2112 
Arg Pro He Glu Val He Glu Gly Pro Leu Met Ser Gly Met Asn He 
690 695 700 

gtc ggc gac ctg ttc ggc gcc ggc aaa atg ttc ctg ccg caa gtg gtg 2160 
Val Gly Asp Leu Phe Gly Ala Gly Lys Met Phe Leu Pro Gin Val Val 
70S 710 715 720 

aaa tec gcc cgc gtg atg aag cag gcc gtg gcg cac ctg att ccg ttc 2208 
Lys Ser Ala Arg Val Met Lys Gin Ala Val Ala His Leu He Pro Phe 
725 730 735 

ate gaa ctg gaa aaa ggc gac aag ccg gaa gcc aag ggc aag ate ctg 2256 
He Glu Leu Glu Lys Gly Asp LyB Pro Glu Ala Lys Gly Lys He Leu 
740 745 750 

atg gcc acg gtc aaa ggc gac gtg cac gac ate ggc aag aac ate gtc 2304 
Met Ala Thr Val Lys Gly Asp Val His Asp He Gly Lys Asn He Val 
755 760 765 

BSC gtg gtg ctg ggt tgc aac ggc tac gac ate gtc gac etc ggc gtg 2352 
Gly Val Val Leu Gly Cys Asn Gly Tyr Asp He Val Asp Leu Gly Val 
770 775 780 

atg gtg ccg gcg gag aag ate ctg cag gtg gcc aag gag cag aag tgc 2400 
Met Val Pro Ala Glu Lys He Leu Gin Val Ala Lys Glu Gin Lys Cys 
7 *5 790 795 800 

gac ate ate ggc ctg tec ggt ctg ate acc ccg teg ctg gat gag atg 2448 
Asp He He Gly Leu Ser Gly Leu He Thr Pro Ser Leu Asp Glu Met 
80S 810 815 

gtc cat gtg gcc cgc gag atg cag cgc cag gac ttc cac ctg ccg ctg 2496 
Val His Val Ala Arg Glu Met Gin Arg Gin Asp Phe His Leu Pro Leu 
820 825 830 
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atg ate ggc ggc gcg acc acc tec aag gcg cac acg gcg gtg aag ate 2544 
Met He Gly Gly Ala Thr Thr Ser Lye Ala His Thr Ala Val Lys He 
835 840 845 

gag ccc aag tac age aac gac gca gtg gtc tac gtg acc gac gec tec 2592 
Glu Pro Lys Tyr Ser Asn Asp Ala Val Val Tyr Val Thr Asp Ala Ser 
850 855 660 

cgc gee gtg ggc gtg gcg acg cag ttg ctg tec aag gaa ctg aaa gee 2640 
Arg Ala Val Gly Val Ala Thr Gin Leu Leu Ser Lys Glu Leu Lys Ala 
865 870 875 860 

ggt ttc gtc cag aag acc cgc gaa gag tac ate gac gtc cgc gag cgc 2686 
Gly Phe Val Gin Lys Thr Arg Glu Glu Tyr He Asp Val Arg Glu Arg 
885 890 895 

acc gee aac cgc age gee cgc acc gaa cgc ctg age tac gec gec gcg 2736 
Thr Ala Asn Arg Ser Ala Arg Thr Glu Arg Leu Ser Tyr Ala Ala Ala 
900 905 910 

ate gec aag aag ccg cag ttc gac tgg gee act tac acc ccg gtc aaa 2784 
He Ala Lys Lys Pro Gin Phe Asp Trp Ala Thr Tyr Thr Pro Val Lys 
915 920 925 

ccg acc ttc acc ggc acc cgc gtg ctg gac aac ate gac etc aac gtt 2832 
Pro Thr Phe Thr Gly Thr Arg Val Leu Asp Asn He Asp Leu Asn Val 
930 935 940 

etc gee gag tac ate gac tgg acg ccg ttc ttc ate tec tgg' gac ctg 2880 
Leu Ala Glu Tyr He Asp Trp Thr Pro Phe Phe He Ser Trp Asp Leu 
945 950 955 960 

gec ggc aag ttc ccg cgc ate etc gaa gac gaa gtg gtc ggc gaa gcg 2928 
Ala Gly Lys Phe Pro Arg He Leu Glu Asp Glu Val Val Gly Glu Ala 
965 970 975 

gcg acc gcg ctg tac aag gac get cgc gag atg ctg acc aag ctg ate 2976 
Ala Thr Ala Leu Tyr Lys Asp Ala Arg Glu Met Leu Thr Lys Leu He 
980 985 990 

gac gag aaa ctg ate age gec cgt gcg gtg ttc ggc ttc tgg ccg gee 3024 
Asp Glu Lys Leu He Ser Ala Arg Ala Val Phe Gly Phe Trp Pro Ala 
995 iooo 1005 

aat cag gtg cac gac gac gat ate gag ctg tac ggc gat gac ggc aag 3072 
Asn Gin Val His Asp Asp Asp He Glu Leu Tyr Gly Asp Asp Gly Lys 
1010 1015 1020 

cca atg gcg cgc ctg cat cac ctg cgc cag cag ate ate aag acc gac 3120 
Pro Met Ala Arg Leu His His Leu Arg Gin Gin He He Lys Thr Asp 
1025 1030 1035 ' 1040 

ggc aaa ccg aac ttc tec etc gee gac ttc gtc gcg ccg aag gac age 3168 
Gly Lys Pro Asn Phe Ser Leu Ala Asp Phe Val Ala Pro Lys Asp Ser 
1045 1050 1055 

gaa gtg acc gac tac gtt ggt ggt ttc ate acc acc gee ggg ate ggc 3216 
Glu Val Thr Asp Tyr Val Gly Gly Phe He Thr Thr Ala Gly He Gly 
1060 1065 1070 

H** 2? a 9t ? 9CC 339 9CC tat cag 9 ac 9 CC 99° 9ac gat tac aac 3264 
Ala Glu Glu Val Ala Lys Ala Tyr Gin Asp Ala Gly Asp Asp Tyr Asn 
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1075 1080 1085 

teg ate atg gtc aag gee ctg gee gac cgt ctg gee gag gcg tgc gee 3312 
Ser He Met Val Lys Ala Leu Ala Asp Arg Leu Ala Qlu Ala Cy B Ala 
1090 1095 iioo 

gag tgg ctg cac cag cag gtg cgc aaa gag cac tgg ggt tac gec aag 3360 
Glu Trp Leu His Gin Gin Val Arg Lys Glu His Trp Gly Tyr Ala Lys 
1105 1110 ins xizo 

gat gaa gec etc gat aac gag gcg ctg ate aaa gag cag tat tec ggc 3408 
Asp Glu Ala Leu Asp Asn Glu Ala Leu He Lys Glu Gin Tyr Ser Gly 
H25 , mo H35 

ate cgc cct gec ccc ggc tac ccg gcg tgc ccg gat cac acc gag aag 3456 
He Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Asp His Thr Glu Lys 
1140 H45 H50 

gec acc ctg ttc gec ctg etc gac cct gaa gca cag gaa atg cgc gec 3504 
Ala Thr Leu Phe Ala Leu Leu Asp Pro Glu Ala Gin Glu Met Arg Ala 
1155 H60 H65 

ggc cgc age ggt gtg ttc etc acc gag cac tac gcg atg ttc ccg gcg 3552 
Gly Arg Ser Gly Val Phe Leu Thr Glu His Tyr Ala Met Phe Pro Ala 
H70 H75 1180 

gca gec gtc age ggc tgg tac ttc gec cat ccg cag gcg cag tac ttc 3600 
Ala Ala Val Ser Gly Trp Tyr Phe Ala His Pro Gin Ala Gin Tyr Phe 
H fi 5 1190 1195 1200 

gec gtg ggc aag gtc gac aag gat cag gtg cag age tac acc teg cgc 3648 
Ala val Gly Lys Val Asp Lys Asp Gin Val Gin Ser Tyr Thr Ser Arg 
1205 1210 1215 

aaa ggc cag gaa ctg age ctg acc gag cgc tgg ctg gca ccc aat ctg 3696 
Lys Gly Gin Glu Leu Ser Leu Thr Glu Arg Trp Leu Ala Pro Asn Leu 
1220 1225 1230 

ggc tac gac aac tga 3711 
Gly Tyr Asp Asn 
1235 



<210> 26 
<211> 1236 
<212> PRT 

<213> Pseudomonas fluorescens 
<400> 26 

Met Ser Asp Arg Ser Val Arg Leu Gin Ala Leu Lys Gin Ala Leu Lys 
15 io 15 

Glu Arg lie Leu He Leu Asp Gly Gly Met Gly Thr Met He Gin Ser 
20 25 30 

Tyr Lys Leu Glu Glu Gin Asp Tyr Arg Gly Lys Arg Phe Ala Asp Trp 
35 40 45 

Pro Ser Asp Val Lys Gly Asn Asn Asp Leu Leu Val Leu Thr Arg Pro 
50 55 60 

Asp Val lie Gly Gly He Glu Lys Ala Tyr Leu Asp Ala Gly Ala Asp 
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65 70 75 80 

He Leu Glu Thr Asn Thr Phe Asn Ala Thr Gin He Ser Met Ala Asp 
85 90 ,95 

Tyr Gly Met Glu Glu Leu Val Tyr Glu Leu Asn Val Glu Gly Ala Arg 
100 105 no 

Leu Ala Arg Lys Val Ala Asp Ala Lys Thr Leu Glu Thr Pro Asp Lys 
115 120 125 

Pro Arg Phe Val Ala Gly Val Leu Gly Pro Thr Ser Arg Thr Cys Ser 
130 135 140 

Leu Ser Pro Asp Val Asn Asn Pro Gly Tyr Arg Asn Val Thr Phe Asp 
145 150 155 160 

Glu Leu Val Glu Asn Tyr Thr Glu Ala Thr Lys Gly Leu He Glu Gly 
165 170 175 

Gly Ala Asp Leu He Leu lie Glu Thr He Phe Asp Thr Leu Asn Ala 
180 185 iso 

Lys Ala Ala He Phe Ala Val Gin Gly Val Phe Glu Glu Leu Gly Phe 
195 200 205 

Glu Leu Pro He Met He Ser Gly Thr He Thr Asp Ala Ser Gly Arg 
210 215 220 

Thr Leu Ser Gly Gin Thr Thr Glu Ala Phe Trp Asn Ser Val Ala His 
225 230 235 240 

Ala Lys Pro He Ser Val Gly Leu Asn Cys Ala Leu Gly Ala Arg Glu 
245 250 ^ 255 

Leu Arg Pro Tyr Leu Glu Glu Leu Ser Asp Lys Ala Ser Thr His Val 
260 265 270 

Ser Ala His Pro Asn Ala Gly Leu Pro Asn Glu Phe Gly Glu Tyr Asp 
275 280 285 

Glu Leu Pro Val Asp Thr Ala Lys Val He Glu Glu Phe Ala Gin Ser 
290 295 300 

Gly Phe Leu Asn He Val Gly Gly Cys Cys Gly Thr Thr Pro Gly His 
305 310 315 320 

He Glu Ala He Ala Lys Ala Val Ala Gly Tyr Ala Pro Arg Gin He 
325 330 335 

Pro Asp He Pro Lys Ala Cys Arg Leu Ser Gly Leu Glu Pro Phe Thr 
340 345 350 

He Asp Arg Ser Ser Leu Phe Val Asn Val Gly Glu Arg Thr Asn He 
355 360 365 

Thr Gly Ser Ala Lys Phe Ala Arg Leu He Arg Glu Asp Asn Tyr Thr 
370 375 380 

Glu Ala Leu Glu Val Ala Leu Gin Gin Val Glu Ala Gly Ala Gin Val 
385 390 395 * 400 
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lie Asp He Asn Met Asp Glu Gly Met Leu Asp Ser Lye Lys Ala Met 
405 410 415 

Val Thr Phe Leu Asn Leu lie Ala Gly Glu Pro Asp He Ser Arg Val 
420 425 430 

Pro He Met He Asp Ser Ser Lys Trp Asp Val He Glu Ala Gly Leu 
435 440 445 

Lys Cys He Gin Gly Lys Gly He Val Asn Ser He Ser Met Lys Glu 
450 455 460 

Gly Val Glu Gin Phe He His His Ala Lys Leu Cys Lys Arg Tyr Gly 
465 470 475 480 

Ala Ala Val Val Val Met Ala Phe Asp Glu Ala Gly Gin Ala Asp Thr 
485 490 495 

Glu Ala Arg Lys Lys Glu He Cys Lys Arg Ser Tyr Asp He Leu Val 
500 505 510 

Asn Glu Val Gly Phe Pro Pro Glu Asp He He Phe Asp Pro Asn He 
515 520 525 

Phe Ala Val Ala Thr Gly He Glu Glu Hie Asn Asn Tyr Ala Val Asp 
530 535 540 

Phe He Asn Ala Cys Ala Tyr He Arg Asp Glu Leu Pro Tyr Ala Leu 
545 550 555 * 560 

Ser Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Asn 
565 570 575 

Pro Val Arg Glu Ala He His Ser Val Phe Leu Leu Tyr Ala He Arg 
580 585 590 

Ala Gly Leu Thr Met Gly He Val Asn Ala Gly Gin Leu Glu He Tyr 
595 600 605 

Asp Gin lie Pro Gin Glu Leu Arg Asp Ala Val Glu Asp Val He Leu 
610 615 620 

Asn Arg Thr Pro Glu Gly Thr Asp Ala Leu Leu Ala He Ala Asp Lys 
625 630 635 640 

Tyr Lys Gly Asp Gly Ser Val Lys Glu Ala Glu Thr Glu Glu Trp Arg 
645 650 655 

Gly Trp Asp Val Asn Lys Arg Leu Glu His Ala Leu Val Lys Gly lie 
660 665 670 

Thr Thr His He Val Glu Asp Thr Glu Glu Ser Arg Gin Ser Phe Ala 
675 680 685 

Arg Pro He Glu Val lie Glu Gly Pro Leu Met Ser Gly Met Asn lie 
690 695 700 

Val Gly Asp Leu Phe Gly Ala Gly Lys Met Phe Leu Pro Gin Val Val 
7 °5 710 715 720 

Lys Ser Ala Arg Val Met Lys Gin Ala Val Ala His Leu lie Pro Phe 
725 730 735 
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He Glu Leu Glu Lye Gly Asp Lys Pro Glu Ala Lys Gly Lys He Leu 
740 745 750 

Met' Ala Thr Val Lys Gly Asp Val His Asp He Gly Lys Asn' lie Val 
755 760 765 

Gly Val Val Leu Gly Cys Asn Gly Tyr Asp He Val Asp Leu Gly Val 
770 775 780 

Met Val Pro Ala Glu Lys He Leu Gin Val Ala Lys Glu Gin Lys Cys 
7 *5 790 795 800 

Asp He He Gly Leu Ser Gly Leu He Thr Pro Ser Leu Asp Glu Met 
805 810 815 

Val His Val Ala Arg Glu Met Gin Arg Gin Asp Phe His Leu Pro Leu 
820 825 830 

Met He Gly Gly Ala Thr Thr Ser Lys Ala His Thr Ala Val Lys He 
835 840 845 

Glu Pro Lys Tyr Ser Asn Asp Ala Val Val Tyr Val Thr Asp Ala Ser 
850 8S5 860 

Arg Ala Val Gly Val Ala Thr Gin Leu Leu Ser Lys Glu Leu Lys Ala 
8^5 870 875 880 

Gly Phe Val Gin Lys Thr Arg Glu Glu Tyr He Asp Val Arg Glu Arg 
885 890 895 

Thr Ala Asn Arg Ser Ala Arg Thr Glu Arg Leu Ser Tyr Ala Ala Ala 
900 905 910 

He Ala Lys Lys Pro Gin Phe Asp Trp Ala Thr Tyr Thr Pro Val Lys 
915 920 925 

Pro Thr Phe Thr Gly Thr Arg Val Leu Asp Asn He Asp Leu Asn Val 
930 935 940 

Leu Ala Glu Tyr He Asp Trp Thr Pro Phe Phe He Ser Trp Asp Leu 
945 950 955 960 

Ala Gly Lys Phe Pro Arg He Leu Glu Asp Glu Val Val Gly Glu Ala 
965 970 975 

Ala Thr Ala Leu Tyr Lys Asp Ala Arg Glu Met Leu Thr Lys Leu He 
980 985 990 

Asp Glu Lys Leu He Ser Ala Arg Ala Val Phe Gly Phe Trp Pro Ala 
995 1000 1005 

Asn Gin Val His Asp Asp Asp He Glu Leu Tyr Gly Asp Asp Gly Lys 
1010 1015 1020 

Pro Met Ala Arg Leu His His Leu Arg Gin Gin He He Lys Thr Asp 
1025 1030 1035 1040 

Gly Lys Pro Asn Phe Ser Leu Ala Asp Phe Val Ala Pro Lys Asp Ser 
1045 1050 1055 

Glu Val Thr Asp Tyr Val Gly Gly Phe He Thr Thr Ala Gly lie Gly 
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1060 1065 1070 

Ala Glu Glu Val Ala Ly B Ala Tyr Gin Asp Ala Gly Asp Asp Tyr Asn 
1075 1080 1085 

Ser lie Met Val Lys Ala Leu Ala Asp Arg Leu Ala Glu Ala Cys Ala 
1090 1095 1100 

Glu Trp Leu His Gin Gin Val Arg Lys Glu His Trp Gly Tyr Ala Lys 
H°5 1110 ins 1120 

Asp Glu Ala Leu Asp Asn Glu Ala Leu He Lys Glu Gin Tyr Ser Gly 
1125 H30 1135 

He Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Asp His Thr Glu Lys 
1140 H45 1150 

Ala Thr Leu Phe Ala Leu Leu Asp Pro Glu Ala Gin Glu Met Arg Ala 
H55 H60 H65 

Gly Arg Ser Gly Val Phe Leu Thr Glu His Tyr Ala Met Phe Pro Ala 
H70 H75 1180 

Ala Ala Val Ser Gly Trp Tyr Phe Ala His Pro Gin Ala Gin Tyr Phe 
1190 H95 1200 

Ala Val Gly Lys Val Asp Lys Asp Gin Val Gin Ser Tyr Thr Ser Arg 
1205 1210 1215 

Lys Gly Gin Glu Leu Ser Leu Thr Glu Arg Trp Leu Ala Pro Asn Leu 
1220 1225 1230 

Gly Tyr Asp Asn 
1235 



<210> 27 
<211> 3705 
<212> DNA 

<213> Pseudomona8 aeruginosa 

<220> 
<221> CDS 
<222> (1)..(3702) 
<223> RPA01772 

<400> 27 

atg tec age ccg etc acc gat cgc age gee cgc ctg caa gec etc cag 48 
Met Ser Ser Pro Leu Thr Asp Arg Ser Ala Arg Leu Gin Ala Leu Gin 
1 5 io 15 



cac gee etc agg gaa cgt ate ctg ate etc gat ggc ggc atg ggc acc 
His Ala Leu Arg Glu Arg He Leu He Leu Asp Gly Gly Met Gly Thr 
20 25 * 30 



96 



atg ate cag age tac aag ctg gaa gag gee gac tac cgc ggc gag cgc 144 
Met He Gin Ser Tyr Lys Leu Glu Glu Ala Asp Tyr Arg Gly Glu Arg 
35 40 45 



ttc gee gac tgg ccg age gac gtg aaa ggc aac aac gac etc ttg ctg 
Phe Ala Asp Trp Pro Ser Asp Val Lys Gly A B n Asn Asp Leu Leu Leu 
50 55 60 



192 
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ctg age cgc ccg gac gtg ate cag gec ate gag aag gec tac etc gac 240 
Leu Ser Arg Pro Asp Val lie Gin Ala He Glu Lye Ala Tyr Leu Asp 
« 70 75 . \ f 0 



gec ggc gee gac ate etc gag ace aac ace ttc aac gee ace cag gtg 
Ala Gly Ala Asp He Leu Glu Thr Asn Thr Phe Asn Ala Thr Gin Val 
85 so 95 



288 



tec cag gee gac tac ggc atg cag teg ctg gec tac gaa etc aac gtc 336 
Ser Gin Ala Asp Tyr Gly Met Gin Ser Leu Ala Tyr Glu Leu Asn Val 
100 105 no 

gaa ggg gcg cgc ctg gec cgc cag gtg gcg gac gcg aag acc gec gag 384 
Glu Gly Ala Arg Leu Ala Arg Gin Val Ala Asp Ala Lys Thr Ala Glu 
115 120 125 

acc ccg gac aag ccg cgt ttc gtc gee ggc gtg etc ggc ccg acc age 432 
Thr Pro Asp Lys Pro Arg Phe Val Ala Gly Val Leu Gly Pro Thr Ser 
130 135 140 

cgc acc tgc teg att tec ccg gac gtg aac aac ccc ggc tac cgc aac 480 
Arg Thr Cys Ser He Ser Pro Asp Val Asn Asn Pro Gly Tyr Arg Asn 
14S 150 i 55 160 

gtc acc ttc gac gaa ctg gtg gag aac tac gtc gag gcg acc cga ggc 528 
Val Thr Phe Asp Glu Leu Val Glu Asn Tyr Val Glu Ala Thr Arg Gly 
165 170 175 

ctg ate gaa ggc ggc gee gac ctg ate ctg ate gag acc ate ttc gac 576 
Leu He Glu Gly Gly Ala Asp Leu He Leu He Glu Thr He Phe Asp 
180 185 190 

acc etc aac gec aag gcg gcg ate ttc gec gtc cag ggc gtg ttc gag 624 
Thr Leu Asn Ala Lys Ala Ala He Phe Ala Val Gin Gly Val Phe Glu 
135 200 205 

gaa etc ggc gtg gag ctg ccg ate atg ate tec gga acc ate acc gac 672 
Glu Leu Gly Val Glu Leu Pro He Met He Ser Gly Thr He Thr Asp 
210 215 220 

gec tec ggc cgc acc ctg teg ggc cag acc acc gag gec ttc tgg aac 720 
Ala Ser Gly Arg Thr Leu Ser Gly Gin Thr Thr Glu Ala Phe Trp Asn 
225 230 235 240 

teg gtg egg cat gee egg ccg ate teg gta ggc ctg aac tgc gee etc 768 
Ser Val Arg His Ala Arg Pro He Ser Val Gly Leu Asn Cys Ala Leu 
24 5 250 255 

ggc gee aag gaa ttg egg ccg tac ate gag gaa ctg teg acc aag gee 816 
Gly Ala Lys Glu Leu Arg Pro Tyr He Glu Glu Leu Ser Thr Lys Ala 
260 265 270 

gac act cat gtc teg gec cac ccc aac gee ggc ctg ccg aac gec ttc 864 
Asp Thr His Val Ser Ala His Pro Asn Ala Gly Leu Pro Asn Ala Phe 
275 280 285 

ggc gaa tac gac gaa teg ccg gcg gaa atg gee gtg gtg gtc gag gaa 912 
Gly Glu Tyr Asp Glu Ser Pro Ala Glu Met Ala Val Val Val Glu Glu 
290 295 300 



ttc gec gec gee ggc ttc etc aat ate gtc ggc ggc tgc tgc ggc acc 



960 



f 
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Phe Ala Ala Ala Gly Phe Leu Asn lie Val Gly Gly Cys Cys Gly Thr 
3 °5 310 315 320 

acc ccg gcg cac ate gag gcg ate gee aag gca gtg gee aag tac ccg 1008 
Thr Pro Ala His He Glu Ala He Ala Lys Ala Val Ala Lys Tyr Pro 
325 330 335 

ccg egg gee ate ccg gag att ccc egg gee tgt cgc ctg tec ggc ctg 1056 
Pro Arg Ala He Pro Glu He Pro Arg Ala Cys Arg Leu Ser Gly Leu 
340 345 350 

gag ccg ttc acc ate gac cgc age teg ctg ttc gtc aac gtc ggc gag 1104 
Glu Pro Phe Thr He Asp Arg Ser Ser Leu Phe Val Asn Val Gly Glu 
355 360 365 

cgc acc aac ate acc ggt teg gee aag ttc gee egg ctg ate cgc gag 1152 
Arg Thr Asn He Thr Gly Ser Ala Lys Phe Ala Arg Leu He Arg Glu 
370 375 380 

gaa aac tac gcg gaa get etc gag gtc gec cag cag cag gtg gaa gee 1200 
Glu Asn Tyr Ala Glu Ala Leu Glu Val Ala Gin Gin Gin Val Glu Ala 
385 390 395 400 

ggc gee cag gtg ate gac ate aac atg gac gaa ggc atg ctg gac teg 1248 
Gly Ala Gin Val He Asp He Asn Met Asp Glu Gly Met Leu Asp Ser 
405 410 415 

aag gcg gee atg gtc acc ttc etc aac ctg ate gee tec gag ccc gac 1296 
Lys Ala Ala Met Val Thr Phe Leu Asn Leu He Ala Ser Glu Pro Asp 
420 425 430 

ate teg cgc gtg ccg ate atg ate gac tec tec aag tgg gaa gtg ate 1344 
He Ser Arg Val Pro He Met He Asp Ser Ser Lys Trp Glu Val He 
435 440 445 

gag gee ggc ctg aag tgc ate cag ggc aag ggc ate gtc aac teg ate 1392 
Glu Ala Gly Leu Lys Cys He Gin Gly Lys Gly He Val Asn Ser lie 
450 455 460 

teg atg aag gaa ggc gtc gag gee ttc aag cac cat gee cgc ctg tgc 1440 
Ser Met Lys Glu Gly Val Glu Ala Phe Lys His His Ala Arg Leu Cys 
465 470 475 480 

aag cgc tac ggc gee gcg gtg gtg gtg atg gee ttc gac gag gac ggc 1488 
Lys Arg Tyr Gly Ala Ala Val Val Val Met Ala Phe Asp Glu Asp Gly 
485 490 495 

cag gee gac acc cag gcg cgc aag gaa gaa ate tgc aag cgc tec tac 1536 
Gin Ala Asp Thr Gin Ala Arg Lys Glu Glu He Cys Lys Arg Ser Tyr 
500 505 * 510 

gac ate ctg gtc gac gaa gtc ggc ttc cca ccg gaa gac ate ate ttc 1584 
Asp He Leu Val Asp Glu Val Gly Phe Pro Pro Glu Asp He He Phe 
515 520 525 

gat gcg aac ate ttc gee ate gee acc ggc ate gag gaa cac aac aac 1632 
Asp Ala ABn He Phe Ala He Ala Thr Gly He Glu Glu His Asn Asn 
530 535 540 



tac gcg gtc gat ttc ate aac gee tgc gee tac ate cgc gac aac etc 
Tyr Ala Val Asp Phe He Asn Ala Cys Ala Tyr He Arg Asp Asn Leu 
545 550 555 5 60 



1680 
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ccc tac gcc ctg age teg ggc ggg gtg tec aac gtg tec ttc. teg ttc 1728 
Pro Tyr Ala Leu Ser Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe 
565 570 SIS 

cgc ggc aac aac ccg gta cgc gag gcg ate cac teg gtg ttc etc tac 1776 
Arg Gly Asn Asn Pro Val Arg Glu Ala He His Ser Val Phe Leu Tyr 
580 585 590 

tac gcg ate cgc aac ggc ctg ace atg ggc ate gtc aac gcc ggc cag 1824 
Tyr Ala He Arg Asn Gly Leu Thr Met Gly He Val Asn Alfet Gly Gin 
595 600 60S 

ctg gaa ate tac gac gag att ccg aaa gcg ctg cgc gac egg gtc gag 1872 
Leu Glu He Tyr Asp Glu He Pro Lys Ala Leu Arg Asp Arg Val Glu 
610 615 620 

gac gtg gtg etc aac cgc acg ccc gag gcc ace gag gcc ctg ctg gcg 1920 
Asp Val Val Leu Asn Arg Thr Pro Glu Ala Thr Glu Ala Leu Leu Ala 
«5 630 635 640 

ate gcc gac gac tac aag ggc ggc ggc gcg gtc aag gag gcc gag gac 1968 
He Ala Asp Asp Tyr Lys Gly Gly Gly Ala Val Lys Glu Ala Glu Asp 
645 650 655 

gag gaa tgg cgc age tac age gtc gag aag cgc etc gag cat gcg ctg 2016 
Glu Glu Trp Arg Ser Tyr Ser Val Glu Lys Arg Leu Glu His Ala Leu 
660 665 670 

gtc aag ggc ate acc acc tgg ate gtc gag gac acc gag gaa tgc cgc 2064 
Val Lys Gly He Thr Thr Trp He Val Glu Asp Thr Glu Glu Cys Arg 
675 680 685 

cag cag tgt gcg cgt ccc ate gag gtc ate gaa ggt ccg ctg atg tec 2112 
Gin Gin Cys Ala Arg Pro He Glu Val He Glu Gly Pro Leu Met Ser 
690 695 700 

ggg atg aac gtg gtc ggc gac ctg ttc ggc gcc ggc aag atg ttc etc 2160 
Gly Met Asn Val Val Gly Asp Leu Phe Gly Ala Gly Lys Met Phe Leu 
705 710 715 720 

ccg cag gtg gtc aag tec gcg cga gtg atg aag cag gcg gtg gcc cac 2208 
Pro Gin Val Val Lys Ser Ala Arg Val Met Lys Gin Ala Val Ala His 
725 730 735 

ctg att ccc ttc ate gag gcg gag aaa ggc gac aag ccg gaa gcc aag 2256 
Leu He Pro Phe He Glu Ala Glu Lys Gly Asp Lys Pro Glu Ala Lys 
740 745 750 

ggc aag ate ctg atg gcc acg gtg aag ggc gac gtg cac gac ate ggc 2304 
Gly Lys He Leu Met Ala Thr Val Lys Gly Asp Val His Asp He Gly 
755 760 765 

aag aac ate gtc ggc gtg gtg etc ggc tgc aac ggc tat gac gtg gtc 2352 
Lys Asn He Val Gly Val Val Leu Gly Cys Asn Gly Tyr Asp Val Val 
770 775 780 

gac etc ggc gtg atg gtg ccg gcg gag aag ate ctg cag acc gcc ate 2400 
Asp Leu Gly Val Met Val Pro Ala Glu Lys He Leu Gin Thr Ala He 
785 790 795 800 

gcc gag aaa tgc gac ate ate ggc ctg tct ggc ctg ate acg ccg teg 244 8 



WO 03/087386 
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Ala Qlu Lys Cys Asp He He Gly Leu Ser Gly Leu He Thr Pro Ser 
805 810 815 

ctg gac gag atg gtc cac gtc gcc aag gaa atg cag egg cag aat ttc 2496 
Leu Asp Glu Met Val His Val Ala Lys Glu Met Gin Arg Gin Asn Phe 
820 825 830 

cag ttg ccg ctg atg ate ggc ggc gcc act ace teg aag gcg cat acc 2544 
Gin Leu Pro Leu Met He Gly Gly Ala Thr Thr Ser Lys Ala His Thr 
835 840 645 

gcg gtg aag ate gat ccg cag tac age aac gac gcg gtg gtc tac gtc 2592 
Ala Val Lys He Asp Pro Gin Tyr Ser Asn Asp Ala Val Val Tyr Val 
850 855 860 

acc gac gcc teg cgc gcg gta ggc gtg gcc acc age ctg ctg tec aag 2640 
Thr Asp Ala Ser Arg Ala Val Gly Val Ala Thr Ser Leu Leu Ser Lys 
865 870 875 880 

gag ctg aag gcc gac tac gtg gcc cgc acc cgc gcc gac tac gcg gtg 2688 
Glu Leu Lys Ala Asp Tyr Val Ala Arg Thr Arg Ala Asp Tyr Ala Val 
885 890 895 

gtc cgc gaa cgc acg gcc aac cgc age gcc cgc acc gag egg ctg age 2736 
Val Arg Glu Arg Thr Ala Asn Arg Ser Ala Arg Thr Glu Arg Leu Ser 
500 90S 910 

tac gaa cag gcg ate gcc aac aag ccg gcg ttc gac tgg gcc ggc tac 2784 
Tyr Glu Gin Ala He Ala Asn Lys Pro Ala Phe Asp Trp Ala Gly Tyr 
915 920 925 

cag gcg ccg acg cct tec ttc acc ggc gtc agg gtg etc gac gag ate 2832 
Gin Ala Pro Thr Pro Ser Phe Thr Gly Val Arg Val Leu Asp Glu He 
930 935 940 

gac etc gcg gtg etc gcc gag tac ate gac tgg acg ccg ttc ttc att 2880 
Asp Leu Ala Val Leu Ala Glu Tyr He Asp Trp Thr Pro Phe Phe He 
945 950 955 960 

tec tgg gac ctg gcc ggc aag tac ccg cgc ate etc acc gac gag gtg 2928 
Ser Trp Asp Leu Ala Gly Lys Tyr Pro Arg He Leu Thr Asp Glu Val 
965 970 975 

gtc ggc gag gcc gcc acc teg ttg ttc aac gac gcc cag gcg atg ctg 2976 
Val Gly Glu Ala Ala Thr Ser Leu Phe Asn Asp Ala Gin Ala Met Leu 
980 985 990 

aag aag ctg ate gac gag aag ctg ate aag gcc cgc gcg gtg ttc ggc 3024 
Lys Lys Leu He Asp Glu Lys Leu He Lys Ala Arg Ala Val Phe Gly 
995 1000 1005 

ttc tgg ccg gcc aac cag gtc gag cac gac gac ctg gag gtc tac ggc 3072 
Phe Trp Pro Ala Asn Gin Val Glu His Asp Asp Leu Glu Val Tyr Gly 
1010 1015 1020 

gcc gat ggc gag acc etc gcc acc ctg cac cac ctg egg cag cag acg 3120 
Ala Asp Gly Glu Thr Leu Ala Thr Leu His His Leu Arg Gin Gin Thr 
1025 1030 1035 1040 

ate aag ccg gac ggc aag ccg aac ctg teg ctg gcc gat ttc gtc gcg 3168 
He Lys Pro Asp Gly Lys Pro Asn Leu Ser Leu Ala Asp Phe Val Ala 
1045 1050 1055 
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ccg aag gaa age ggc gtg cgc gac tac ate ggc ggc ttc ate acc acc 3216 
Pro Lys Glu Ser Gly Val Arg Asp Tyr He Gly Gly Phe He Thr Thr 

1060 1065 .1070 

, * 

gcc ggg ate ggc gcc gag gaa gtg gcc aag gcg tac gaa gcc aag ggc 3264 
Ala Gly He Gly Ala Glu Glu Val Ala Lys Ala Tyr Glu Ala Lys Gly 
1075 1080 1085 

gac gac tac aac age ate atg gtc aag gcg etc gcc gac cgc etc gcc 3312 
Asp Asp Tyr Asn Ser He Met Val Lys Ala Leu Ala Asp Arg Leu Ala 
1090 1095 lioo 



gaa gcc tgc gcc gag tgg ctg cac gag egg gtg cgc aag gag tac tgg 
Glu Ala Cys Ala Glu Trp Leu His Glu Arg Val Arg Lys Glu Tyr Tro 
1105 mo ins 



3360 



1120 



ggc tac gcc cgc gac gaa cac etc gac aac gag gcc ttg ate aag gag 3408 
Gly Tyr Ala Arg Asp Glu His Leu Asp Asn Glu Ala Leu He Lys Glu 
1125 U30 H35 

caa tac gtc ggc ate cgc ccg gca ccg ggc tac ccg gcc tgc ccc gac 3456 
Gin Tyr Val Gly He Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Asp 
1140 H45 1150 

cat acc gag aaa ggc act ctg ttc gaa ctg etc gat ccg cag ggc ctg 3504 
His Thr Glu Lys Gly Thr Leu Phe Glu Leu Leu Asp Pro Gin Gly Leu 
1155 H60 H65 

tec ggc gtc age ctg acc gag cac tac gcg atg ttc ccg gcc gcg gcg 3552 
Ser Gly Val Ser Leu Thr Glu His Tyr Ala Met Phe Pro Ala Ala Ala 
1170 H75 1180 

gtc age ggt tgg tat ttc gcc cac ccg cag gcg cag tac ttc gcg gtc 3600 
Val Ser Gly Trp Tyr Phe Ala His Pro Gin Ala Gin Tyr Phe Ala Val 
1185 H90 1195 1200 

ggc aag ate gac aag gac cag gtg gaa cgc tac age cag cgc aag ggc 3648 
Gly Lys He Asp Lys Asp Gin Val Glu Arg Tyr Ser Gin Arg Lys Gly 
1205 1210 1215 

cag gaa gcc age gtc age gag cgc tgg ctg gcg ccg aac ctt ggc tac 3696 
Gin Glu Ala Ser Val Ser Glu Arg Trp Leu Ala Pro Asn Leu Gly Tyr 
1220 1225 1230 

gat gac tga 370S 
Asp Asp 



<210> 28 
<211> 1234 
<2\2> PRT 

<213> Pseudomonas aeruginosa 
<400> 26 

Met Ser Ser Pro Leu Thr Asp Arg Ser Ala Arg Leu Gin Ala Leu Gin 
15 io 15 



His Ala Leu Arg olu Arg He Leu He Leu Asp Gly Gly Met Gly Thr 
20 25 30 
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Met lie Gin Ser Tyr Lys Leu Qlu Glu Ala Asp Tyr Arg Gly Glu Arg 
35 40 45 

Phe Ala Asp Trp Pro Ser Asp Val Lys Oly Asn Asn Asp Leu Leu Leu 
50 55 60 

Leu Ser Arg Pro Asp Val lie Gin Ala lie Glu Lye Ala Tyr Leu Asp 
65 70 75 80 

Ala Gly Ala Asp lie Leu Glu Thr Asn Thr Phe Asn Ala Thr Gin Val 
85 90 95 

Ser Gin Ala Asp Tyr Gly Met Gin Ser Leu Ala Tyr Glu Leu Asn Val 
100 105 no 

Glu Gly Ala Arg Leu Ala Arg Gin Val Ala Asp Ala Lys Thr Ala Glu 
115 120 125 

Thr Pro Asp Lys Pro Arg Phe Val Ala Gly Val Leu Gly Pro Thr Ser 
130 135 140 

Arg Thr Cys Ser He Ser Pro Asp Val Asn Asn Pro Gly Tyr Arg Asn 
145 150 155 160 

Val Thr Phe Asp Glu Leu Val Glu Asn Tyr Val Glu Ala Thr Arg Gly 
165 170 175 

Leu He Glu Gly Gly Ala Asp Leu lie Leu He Glu Thr He Phe Abp 
180 185 190 

Thr Leu Asn Ala Lys Ala Ala He Phe Ala Val Gin Gly Val Phe Glu 
155 200 205 

Glu Leu Gly Val Glu Leu Pro He Met He Ser Gly Thr He Thr Asp 
210 215 220 

Ala Ser Gly Arg Thr Leu Ser Gly Gin Thr Thr Glu Ala Phe Trp Asn 
225 230 235 240 

Ser Val Arg His Ala Arg Pro He Ser Val Gly Leu Asn Cys Ala Leu 
245 250 255 

Gly Ala Lys Glu Leu Arg Pro Tyr He Glu Glu Leu Ser Thr Lys Ala 
260 265 270 

Asp Thr His Val Ser Ala His Pro Asn Ala Gly Leu Pro Asn Ala Phe 
275 280 285 

Gly Glu Tyr Asp Glu Ser Pro Ala Glu Met Ala Val Val Val Glu Glu 
290 295 300 

Phe Ala Ala Ala Gly Phe Leu Asn He Val Gly Gly Cys Cys Gly Thr 
305 310 315 ' 320 

Thr Pro Ala His He Glu Ala He Ala Lys Ala Val Ala Lys Tyr Pro 
325 330 335 

Pro Arg Ala lie Pro Glu He Pro Arg Ala Cys Arg Leu Ser Gly Leu 
340 345 " 350 

Glu Pro Phe Thr He Asp Arg Ser Ser Leu Phe Val Asn Val Gly Glu 
355 360 365 
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Arg Thr Asn He Thr Gly Ser Ala Lye Phe Ala Arg Leu He. Arg Glu 
370 375 380 

Glu' Asn Tyr Ala Glu Ala Leu Glu Val Ala Gin Gin Gin Val Glu Ala 
385 390 395 , 400 

Gly Ala Gin Val He Asp He Asn Met Asp Glu Gly Met Leu Asp Ser 
405 410 * 415 

Lys Ala Ala Met Val Thr Phe Leu Asn Leu He Ala Ser Glu Pro Asp 
420 425 430 

He Ser Arg Val Pro He Met He Asp Ser Ser Lys Tip Glu Val He 
435 440 445 

Glu Ala Gly Leu Lys Cys He Gin Gly Lys Gly He Val Asn Ser He 
450 455 460 

Ser Met Lys Glu Gly Val Glu Ala Phe Lys His His Ala Arg Leu Cys 
465 470 475 480 

Lys Arg Tyr Gly Ala Ala Val Val Val Met Ala Phe Asp Glu Asp Gly 
485 490 495 

Gin Ala Asp Thr Gin Ala Arg Lys Glu Glu He Cys Lys Arg Ser Tyr 
500 505 " 510 

Asp He Leu Val Asp Glu Val Gly Phe Pro Pro Glu Asp lie lie Phe 
515 520 525 

Asp Ala Asn He Phe Ala He Ala Thr Gly He Glu Glu His Asn Asn 
530 535 540 

Tyr Ala Val Asp Phe He Asn Ala Cys Ala Tyr He Arg Asp Asn Leu 
545 550 555 560 

Pro Tyr Ala Leu Ser Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe 
565 570 575 

Arg Gly Asn Asn Pro Val Arg Glu Ala lie His Ser Val Phe Leu Tyr 
580 585 590 

Tyr Ala He Arg Asn Gly Leu Thr Met Gly He Val Asn Ala Gly Gin 
595 600 605 

Leu Glu He Tyr Asp Glu He Pro Lys Ala Leu Arg Asp Arg Val Glu 
610 615 620 

Asp Val Val Leu Asn Arg Thr Pro Glu Ala Thr Glu Ala Leu Leu Ala 
625 630 635 640 

He Ala Asp Asp Tyr Lys Gly Gly Gly Ala Val Lys Glu Ala Glu Asp 
645 650 655 

Glu Glu Trp Arg Ser Tyr Ser Val Glu Lys Arg Leu Glu His Ala Leu 
660 665 670 

Val Lys Gly He Thr Thr Trp He Val Glu Asp Thr Glu Glu Cys Arg 
675 680 685 

Gin Gin Cys Ala Arg Pro He Glu Val lie Glu Gly Pro Leu Met Ser 
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690 



695 



700 



Gly Met Asn Val Val Gly Asp Leu Phe Gly Ala Gly Lys Met Phe Leu 
70S 710 715 720 

Pro Gin Val Val Lya Ser Ala Arg Val Met Lye Gin Ala Val Ala Hie 
725 730 735 

Leu lie Pro Phe He Glu Ala Glu Lys Gly Asp Lys Pro Glu Ala Lys 
740 745 750 

Gly Lys He Leu Met Ala Thr Val Lys Gly Asp Val His Asp He Gly 
755 760 765 

Lys Asn He Val Gly Val Val Leu Gly Cys Asn Gly Tyr Asp Val Val 
770 775 780 

Asp Leu Gly Val Met Val Pro Ala Glu Lys He Leu Gin Thr Ala He 
7B5 790 795 800 

Ala Glu Lys Cys Asp He He Gly Leu Ser Gly Leu He Thr Pro Ser 
80S 810 815 

Leu Asp Glu Met Val His Val Ala Lys Glu Met Gin Arg Gin Asn Phe 
820 825 830 

Gin Leu Pro Leu Met He Gly Gly Ala Thr Thr Ser Lys Ala His Thr 
835 840 845 

Ala Val Lys He Asp Pro Gin Tyr Ser Asn Asp Ala Val Val Tyr Val 
850 855 860 

Thr Asp Ala Ser Arg Ala Val Gly Val Ala Thr Ser Leu Leu Ser Lys 
865 870 875 880 

Glu Leu Lys Ala Asp Tyr Val Ala Arg Thr Arg Ala Asp Tyr Ala Val 
885 890 895 

Val Arg Glu Arg Thr Ala Asn Arg Ser Ala Arg Thr Glu Arg Leu Ser 
900 905 910 

Tyr Glu Gin Ala He Ala Asn Lys Pro Ala Phe Asp Trp Ala Gly Tyr 
915 920 925 

Gin Ala Pro Thr Pro Ser Phe Thr Gly Val Arg Val Leu Asp Glu He 
930 935 940 

Asp Leu Ala Val Leu Ala Glu Tyr He Abp Trp Thr Pro Phe Phe He 
945 950 955 960 

Ser Trp Asp Leu Ala Gly Lys Tyr Pro Arg He Leu Thr Asp Glu Val 
965 970 975 

Val Gly Glu Ala Ala Thr Ser Leu Phe Asn Asp Ala Gin Ala Met Leu 
980 985 990 

Lys Lys Leu He Asp Glu Lys Leu He Lys Ala Arg Ala Val Phe Gly 
995 1000 1005 

Phe Trp Pro Ala Asn Gin Val Glu His Asp Asp Leu Glu Val Tyr Gly 



1010 



1015 



1020 
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Ala Asp Oly Glu Thr Leu Ala Thr Leu His His Leu Arg Oln Gin Thr 
1025 1030 10 35 1040 

lie Lys Pro Asp Gly Lye Pro Asn Leu Ser Leu Ala Asp Phe Val Ala 
1045 1050 ~ 1055 

Pro Lys Glu Ser Gly Val Arg Asp Tyr He Gly Gly Phe He Thr Thr 
1060 1065 * 1070 

Ala Gly He Gly Ala Glu Glu Val Ala Lys Ala Tyr Glu Ala Lys Gly 
1075 1080 1085 "'" 

Asp Asp Tyr Asn Ser He Met Val Lys Ala Leu Ala Asp Arg Leu Ala 
1090 1095 lioo 

Glu Ala Cys Ala Glu Trp Leu His Glu Arg Val Arg Lys Glu Tyr Trp 
1105 1110 ins U20 

Gly Tyr Ala Arg Asp Glu His Leu Asp Asn Glu Ala Leu He Lys Glu 
1125 H30 H35 

Gin Tyr Val Gly He Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Asp 
1140 H45 H50 

His Thr Glu Lys Gly Thr Leu Phe Glu Leu Leu Asp Pro Gin Gly Leu 
1155 H60 H65 

Ser Gly Val Ser Leu Thr Glu His Tyr Ala Met Phe Pro Ala Ala Ala 
1170 H7S 1180 

Val Ser Gly Trp Tyr Phe Ala His Pro Gin Ala Gin Tyr Phe Ala Val 
1185 1190 1195 • 1200 

Gly Lys lie Asp Lys Asp Gin Val Glu Arg Tyr Ser Gin Arg Lys Gly 
1205 1210 1215 

Gin Glu Ala Ser Val Ser Glu Arg Trp Leu Ala Pro Asn Leu Gly Tyr 
1220 1225 1230 

Asp Asp 



<210> 29 
<211> 3714 
<212> DNA 

<213> Nitrosomas europeae 

<220> 
<221>.CDS 
<222> (1) (3711) 
<223> RNE01732 

<400> 29 

atg aca atg cat gaa cgt get gat ttg ctg aaa egg ttg ctt gee gag 48 

Met Thr Met His Glu Arg Ala Asp Leu Leu Lys Arg Leu Leu Ala Glu 

15 io 15 



cgt ate ctg atg etc gac ggt gee atg ggt acg atg ate cag age tac 96 
Arg He Leu Met Leu Asp Gly Ala Met Gly Thr Met He Gin Ser Tyr 
20 25 30 



I 
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aaa ctg acc gag teg gat tat egg ggg gaa cgt ttt gee gat ttt ccg 144 
Lys Leu Thr Glu Ser Asp Tyr Arg Gly Glu Arg Phe Ala Asp Phe Pro 
35 40 45 

cat gat etc aaa ggc aac aat gat ctg etc tgc ctg acc aga ccg gaa 192 
His Asp Leu Lys Gly Asn Asn Asp Leu Leu Cys Leu Thr Arg Pro Glu 
50 55 60 

gtc ate cgc tec att cat cgt get tac etc gaa gec ggg teg gat ate 240 
Val lie Arg Ser lie His Arg Ala Tyr Leu Glu Ala Gly Ser Asp He 
65 70 75 80 

ate gag acc aac acg ttc aac teg aat gcg ccg teg atg gcg gac tac 288 
He Glu Thr Asn Thr Phe Asn Ser Asn Ala Pro Ser Met Ala Asp Tyr 
85 90 95 

cac atg cag gat ctg gtg tat gaa ctg aat gtg gcg ggt gcg cgc ctg 336 
His Met Gin Asp Leu Val Tyr Glu Leu Asn Val Ala Gly Ala Arg Leu . 
100 105 HO 

gcg tgt gag gaa gcg egg gca atg gaa acg cag caa cct gac egg ccc 384 
Ala Cys Glu Glu Ala Arg Ala Met Glu Thr Gin Gin Pro Asp Arg Pro 
115 120 125 

cgt ttc gtt gee ggt gtg ate ggg cct acc acc aaa acg get tea etc 432 
Arg Phe Val Ala Gly Val He Gly Pro Thr Thr Lys Thr Ala Ser Leu 
130 135 140 

tea ccg gat gtc aat gat cct gga ttc egg gec att acc ttc gat gat 480 
Ser Pro Asp Val Asn Asp Pro Gly Phe Arg Ala He Thr Phe Asp Asp 
145 150 155 160 

ctg gtg gaa age tat acc gag teg gtg cgc ggg ctg ate gac gga ggc 528 
Leu Val Glu Ser Tyr Thr Glu Ser Val Arg Gly Leu He Asp Gly Gly 
165 170 175 

gcg gat att ctg ctg gtc gaa acc att ttt gac acc ttg aat gee aaa 576 
Ala Asp He Leu Leu Val Glu Thr He Phe Asp Thr Leu Asn Ala Lys 
180 185 190 

gec gca ttg ttt gec ate gat cag tat ttc gaa acg cat gga tta cgt 624 
Ala Ala Leu Phe Ala He Asp Gin Tyr Phe Glu Thr His Gly Leu Arg 
195 200 205 

ctg ccg gtg atg ata teg gtc acg att acc gat get teg gga cgt aat 672 
Leu Pro Val Met He Ser Val Thr He Thr Asp Ala Ser Gly Arg Asn 
210 215 220 

ctt tec ggg cag aca ccg gaa get ttc tgg aat teg gta egg cat gca 720 
Leu Ser Gly Gin Thr Pro Glu Ala Phe Trp Asn Ser Val Arg His Ala 
225 230 235 240 

cgt ccg ctt teg gtg gga ate aac tgc gcg ttg ggt gcg gag ttg atg 768 
Arg Pro Leu Ser Val Gly He Asn Cys Ala Leu Gly Ala Glu Leu Met 
245 250 255 

cgc ccc tac gtg gaa gag ttg tec aat gtg get gag gtt ttc acc age 816 
Arg Pro Tyr Val Glu Glu Leu Ser Asn Val Ala Glu Val Phe Thr Ser 
260 265 270 

gec cat ccc aat gee ggc ttg cct aat ccc ttg gcg gaa acc ggt tat 864 
Ala His Pro Asn Ala Gly Leu Pro Asn Pro Leu Ala Glu Thr Gly Tyr 
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275 280 285 

gac gaa acg ccg gaa tat acc gcc cgt ctg ate aag gat ttt gcg caa 912 
Asp Glu Thr Pro Glu Tyr Thr Ala Arg Leu He Lye Asp Phe Ala Gin 
290 295 300 



tec ggg ttc gtc aac att gtc ggc ggc tgc tgt ggc act aca ccg aaa 
Ser Gly Phe Val Asn He Val Gly Gly Cya Cye Gly Thr Thr Pro Ly B 
305 310 315 320 

cat ate gcg gcc att gca gaa gcg gta egg gac ate cct cc& cgc cca 
Hie He Ala Ala He Ala Glu Ala Val Arg Asp He Pro Pro Arg Pro 
325 330 335 

ctg ccc gat att cct aaa aaa ctg agg ctt tec ggc etc gag ccg etc 
Leu Pro Asp He Pro Lys Lys Leu Arg Leu Ser Gly Leu Glu Pro Leu 
340 345 350 

aat ate gat gaa cat tec ctg ttc gta aac gtg ggt gaa cgt acc aat 
Asn He Asp Glu His Ser Leu Phe Val Asn Val Gly Glu Arg Thr Asn 
355 360 365 

gtc acc ggc tec aag gca ttt gcc egg ctg att etc aat ggc ggt tat 1152 
Val Thr Gly Ser Lys Ala Phe Ala Arg Leu He Leu Asn Gly Gly Tyr 
370 375 380 



get gaa ggg ctg gtg ate gcg cgc age cag gtg gag aac ggc &ca caa 
Ala Glu Gly Leu Val He Ala Arg Ser Gin Val Glu Asn Gly Ala Gin 
385 390 395 



400 



445 



ctg aaa tgt gtc cag ggt aag gcg gtc ate aat tec ate age etc aag 
Leu Lys Cys Val Gin Gly Lys Ala Val He Asn Ser He Ser Leu Lys 
450 455 460 



att gaa cag gcc gat ttc cca ccc gag gat ate att ttc gac ccc aat 
He Glu Gin Ala Asp Phe Pro Pro Glu Asp He He Phe Asp Pro Asn 
515 520 525 



960 



1006 



1056 



1104 



1200 



ate ate gat ate aac atg gat gaa gcg atg ctg gat tea cag aag gcg 
He He Asp He Asn Met Asp Glu Ala Met Leu Asp Ser Gin Lys Ala 
405 410 * 415 

atg gtg acc ttt ctg aat ctg etc get gcc gaa ccg gat ate age egg 
Met Val Thr Phe Leu Asn Leu Leu Ala Ala Glu Pro Asp He Ser Arq 
420 425 430 

ctg ccg ate atg etc gat tec age aaa tgg teg gtg ate gaa gcc gga 1344 
Leu Pro He Met Leu Asp Ser Ser Lys Trp Ser Val He Glu Ala Gly 
435 440 



1248 



1296 



1392 



gaa ggt gaa gcg gag ttt tta cat cat gcc agg ctg gcg cgt cgt tat 1440 
Glu Gly Glu Ala Glu Phe Leu His His Ala Arg Leu Ala Arg Arg Tvr 
465 470 475 480 

ggg gcc gcg gtg att gtc atg get ttc gac gaa acc ggg cag gcc gat 1488 
Gly Ala Ala Val He Val Met Ala Phe Asp Glu Thr Gly Gin Ala Asp 
485 490 495 

acc ttg cag cgc aag gtg gaa ate tgc acg cgt tgt tac cat aca ctg 1536 
Thr Leu Gin Arg Lys Val Glu He Cye Thr Arg Cys Tyr His Thr Leu 
5 °° 505 510 



1584 
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att ttt gcc att get acg ggt ate gaa gaa cac agt aac tat gca gtg 1632 
He Phe Ala He Ala Thr Gly He Glu Glu His Ser Asn Tyr Ala Val 
530 535 540 

gat ttt ate gag gcg aca cac gtc ate egg caa acg ctg cct tat gcc 1680 
Asp Phe He Glu Ala Thr His Val He Arg Gin Thr Leu Pro Tyr Ala 
5 « 550 555 560 

aaa gtc age ggg ggt gtt tec aat gtt tec ttc teg ttc egg ggt aac 1728 
Lys Val Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn 
565 570 575 

gaa ccg ate cgc gaa gcc att cat ace gca ttc ctg tat cac gcg gtc 1776 
Glu Pro He Arg Glu Ala He His Thr Ala Phe Leu Tyr His Ala Val 
580 585 590 

aag gca ggc atg acc atg ggt ate gtc aac gca ggt cag ctt ggg gtt 1824 
Lys Ala Gly Met Thr Met Gly He Val Asn Ala Gly Gin Leu Gly Val 
595 600 605 

tat tec gac att ccg ccc gat ctg ctg gaa cat gtc gag gat gta ctg 1872 
Tyr Ser Asp He Pro Pro Asp Leu Leu Glu His Val Glu Asp Val Leu 
610 615 620 



ctg aac egg egg cct gat gca acc gaa cgt ctg gtg gag ttt gcg gaa 
Leu Asn Arg Arg Pro Asp Ala Thr Glu Arg Leu Val Glu Phe Ala Glu 
625 630 635 



1920 



640 



cat ttc aag gga cag aaa aag gag cag ate gaa gat ctg tec tgg cgt 1968 
His Phe Lys Gly Gin Lys Lys Glu Gin He Glu Asp Leu Ser Trp Arg 
645 650 655 

gat gaa ccg gtg egg cag cgc ctg att cat gca ctg gtc agg ggt ate 2016 
Asp Glu Pro Val Arg Gin Arg Leu He His Ala Leu Val Arg Gly He 
660 665 670 

age acc tac ate gtc gag gat acc gag etc gtc egg cag gag ate gac 2064 
Ser Thr Tyr He Val Glu Asp Thr Glu Leu Val Arg Gin Glu He Asp 
675 680 685 

age cag gga ggc aag ccg ate gag gtg ate gaa ggc ccg etc atg gac 2112 
Ser Gin Gly Gly Lys Pro He Glu Val He Glu Gly Pro Leu Met Asp 
690 695 700 

ggc atg aat gta gtg ggg gat ctg ttt ggc gca ggc aag atg ttt ctg 2160 
Gly Met Asn Val Val Gly Asp Leu Phe Gly Ala Gly Lys Met Phe Leu 
705 710 715 720 

cca cag gtg gtc aag teg gca egg gtg atg aag cag gcg gtt gcc tat 2208 
Pro Gin Val Val Lys Ser Ala Arg Val Met Lys Gin Ala Val Ala Tyr 
725 730 735 

ctg ttg ccg tac ate gag gca gag aaa aaa att tec ggc gac age aag 2256 
Leu Leu Pro Tyr He Glu Ala Glu Lys Lys He Ser Gly Asp Ser Lys 
740 745 750 

ccc aag ggc aag gtg gtg ate get acc gtc aaa ggg gat gtg cat gat 2304 
Pro Lys Gly Lys Val Val He Ala Thr Val Lys Gly Asp Val His Asp 
755 760 765 

att ggc aag aat ate gtt tec gtc gtg ttg cag tgt aat aac ttt gaa 2352 
He Gly Lys Asn He Val Ser Val Val Leu Gin Cys Asn Asn Phe Glu 



WO 03/087386 PCT/EP03/04010 

128 

770 775 780 

gtc ate aac atg ggg gtg atg gtc ccc agt gca cag att ctg gaa aca 2400 
Val He Asn Met Gly Val Met Val Pro Ser Ala Gin lie. Leu Glu Thr 
785' 790 795 800 

gca cgc cgt gaa cag gtc gat atg ate ggt ctg tec ggc ctg ate acc 2448 
Ala Arg Arg Glu Gin Val Asp Met He Gly Leu Ser Gly Leu He Thr 
805 810 815 

cct teg ctg gaa gaa atg gcg cat gtt gee egg gaa atg gag cgt gaa 2496 
Pro Ser Leu Glu Glu Met Ala His Val Ala Arg Glu Met Glu Arg Glu 
820 825 830 

caa ttc acc gtt ccg ctg ctg ate ggt ggc gee acc act teg egg atg 2544 
Gin Phe Thr Val Pro Leu Leu He Gly Gly Ala Thr Thr Ser Arg Met 
835 840 845 

cat acg gca gtc aaa ate gca ccc cat tac ggt ggg gtg acc gta tgg 2592 
Hie Thr Ala Val LyB He Ala Pro His Tyr Gly Gly Val Thr Val Trp 
850 855 860 

gtg ccg gat gee age egg gca gtc ggg gtg tgc age aat ctg atg tea 2640 
Val Pro Asp Ala Ser Arg Ala Val Gly Val Cys Ser Asn Leu Met Ser 
865 870 875 880 

cag gat ctg cgt gat gac tat gtc egg cag gtc aag gee gag dag gag 2688 
Gin Asp Leu Arg Asp Asp Tyr Val Arg Gin Val Lys Ala Glu Gin Glu 
885 890 895 

aag age egg gtg cag cac cgc aac aag aaa ggg cca tec aag etc etc 2736 
Lys Ser Arg Val Gin His Arg Asn Lys Lys Gly Pro Ser Lys Leu Leu 
900 905 910 

act ttc gag gaa gee egg gee aac gca etc aag acg gat tgg get cgt 2784 
Thr Phe Glu Glu Ala Arg Ala Asn Ala Leu Lys Thr Asp Trp Ala Arg 
915 920 925 

tat act cca cca get ccg gat ttc ctg ggg ttg cgc acc etc aac aac 2832 
Tyr Thr Pro Pro Ala Pro Asp Phe Leu Gly Leu Arg Thr Leu Asn Asn 
330 935 940 

tat ccg ctg gaa aca ctg gtg ccg cac ate gac tgg aca cct ttc ttc 2880 
Tyr Pro Leu Glu Thr Leu Val Pro His He Asp Trp Thr Pro Phe Phe 
94 5 950 955 960 

cag gca tgg gaa ctg cac ggg cgc tat cct gec ate ctg cag gat gaa 2928 
Gin Ala Trp Glu Leu His Gly Arg Tyr Pro Ala He Leu Gin Asp Glu 
965 970 975 

etc gtc ggg gaa gca gec age aat ctg ttt cgc gat gee cag aat atg 2976 
Leu Val Gly Glu Ala Ala Ser Asn Leu Phe Arg Asp Ala Gin Asn Met 
980 985 990 

etc aga aaa ate gtc gag caa aaa tgg etc acc gec aac gee gtt ate 3024 
Leu Arg Lys He Val Glu Gin Lys Trp Leu Thr Ala Asn Ala Val He 
995 1000 1005 

ggc ctg ttc ccg gee aat acc gtc aat gga gat gat ate gag att tat 3072 
Gly Leu Phe Pro Ala Asn Thr Val Asn Gly Asp Asp He Glu He Tyr 
1010 1015 1020 
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get gac cgt agt cgc agt cag gtg ate atg acc tgg cac acc ttg egg 3120 
Ala Asp Arg Ser Arg Ser Gin Val He Met Thr Trp Hie Thr Leu Arg 
102 5 1030 1035 1040 

cag cag acg gee aaa ccg gca ggg cgt ccc aat ctg gca ctg get gat 3168 
Gin Gin Thr Ala Lys Pro Ala Gly Arg Pro Asn Leu Ala Leu Ala Aap 
1045 1050 1055 

ttc att gcg ccg cgt gaa acc gga ctg gac gat acc ate ggt ttg ttt 3216 
Phe He Ala Pro Arg Glu Thr Gly Leu Asp Asp Thr He Gly Leu Phe 
1060 1065 1070 

gee gtc age gee ggt ttc ggt ate gat gaa cgc ata cgc get ttt gaa 3264 
Ala Val Ser Ala Gly Phe Gly He Asp Glu Arg He Arg Ala Phe Glu 
1075 1080 1085 

get gca aac gat gat tac agt gec ate ate ctg aaa gca ctg get gat 3312 
Ala Ala Asn Asp Asp Tyr. Ser Ala He He Leu Lys Ala Leu Ala Asp 
1090 1095 HOO 



cgt ctg get gaa gcg ttt gca gaa cac atg cat gca egg gtg egg cga 3360 
Arg Leu Ala Glu Ala Phe Ala Glu His Met His Ala Arg Val Arg Arg 
1105 mo ins * 1120 

gaa ttc tgg ggc tat gtg aaa gat gag agt ctg gac aat gaa cag ttg 3408 
Glu Phe Trp Gly Tyr Val Lys Asp Glu Ser Leu Asp Asn Glu Gin Leu 
1125 H30 H35 

ate gac gag caa tac ctg gga ate cgt cca gca cca ggt tat cct gec 3456 
He Asp Glu Gin Tyr Leu Gly He Arg Pro Ala Pro Gly Tyr Pro Ala 
1140 ii45 1150 

tgc cct gat cat acc gaa aag ggg cca ttg ttc get ctg ctg gaa gcg 3504 
Cys Pro Asp His Thr Glu Lys Gly Pro Leu Phe Ala Leu Leu Glu Ala 
1155 H60 1165 



gaa aaa cgc age gga ate gtc ata acg gaa tea ttt gec atg gtg ccg 
Glu Lys Arg Ser Gly He Val He Thr Glu Ser Phe Ala Met Val Pro 
1170 H75 H80 



3552 



act gca gca gta tec ggc ttc tat etc tct tac cct gaa tec age tat 3600 
Thr Ala Ala Val Ser Gly Phe Tyr Leu Ser Tyr Pro Glu Ser Ser Tyr 
1185 1190 U95 1200 



ttt get gtt gga aaa ate gga aaa gat cag gtc gag gat tat gca aga 
Phe Ala Val Gly Lys He Gly Lys Asp Gin Val Glu Asp Tyr Ala Arg 
1205 1210 1215 

cgc aaa ggg tgg acg ctg gaa gaa gca gaa agg tgg ctt gcg cct gtc 
Arg Lys Gly Trp Thr Leu Glu Glu Ala Glu Arg Trp Leu Ala Pro Val 
1220 1225 1230 

ttg gcg tat gag cgt taa 
Leu Ala Tyr Glu Arg 
1235 



3648 



3696 



3714 



<210> 30 
<211> 1237 
<212> PRT 

<213> Nitrosomas europeae 
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<400> 30 

Met Thr Met His Glu Arg Ala Asp Leu Leu Lys Arg Leu Leu Ala Glu 
15 10 15 

Arg. He Leu Met Leu Asp Gly Ala Met Gly Thr Met He Gin' Ser Tyr 
20 25 30 

Lys Leu Thr Glu Ser Asp Tyr Arg Gly Glu Arg Phe Ala Asp Phe Pro 
35 40 45 

His Asp Leu Lys Gly Asn Asn Asp Leu Leu Cys Leu Thr Arg Pro Glu 
50 55 go 

Val He Arg Ser He His Arg Ala Tyr Leu Glu Ala Gly Ser Asp He 
6S 70 75 80 

He Glu Thr Asn Thr Phe Asn Ser Asn Ala Pro Ser Met Ala Asp Tyr 
85 90 95 

His Met Gin Asp Leu Val Tyr Glu Leu Asn Val Ala Gly Ala Arg Leu 
100 105 no 

Ala Cys Glu Glu Ala Arg Ala Met Glu Thr Gin Gin Pro Asp Arg Pro 
115 120 125 

Arg Phe Val Ala Gly Val He Gly Pro Thr Thr Lys Thr Ala Ser Leu 
130 135 140 

Ser Pro Asp Val Asn Asp Pro Gly Phe Arg Ala He Thr Phe' Asp Asp 
145 150 155 160 

Leu Val Glu Ser Tyr Thr Glu Ser Val Arg Gly Leu He Asp Gly Gly 
165 170 175 

Ala Asp He Leu Leu Val Glu Thr He Phe Asp Thr Leu Asn Ala Lys 
180 185 190 

Ala Ala Leu Phe Ala He Asp Gin Tyr Phe Glu Thr His Gly Leu Arg 
155 200 205 

Leu Pro Val Met He Ser Val Thr He Thr Asp Ala Ser Gly Arg Asn 
210 215 220 

Leu Ser Gly Gin Thr Pro Glu Ala Phe Trp Asn Ser Val Arg His Ala 
225 230 235 240 

Arg Pro Leu Ser Val Gly He Asn Cys Ala Leu Gly Ala Glu Leu Met 
245 250 255 

Arg Pro Tyr Val Glu Glu Leu Ser Asn Val Ala Glu Val Phe Thr Ser 
260 265 270 

Ala His Pro Asn Ala Gly Leu Pro Asn Pro Leu Ala Glu Thr Gly Tyr 
275 280 285 

Asp Glu Thr Pro Glu Tyr Thr Ala Arg Leu He Lys Asp Phe Ala Gin 
250 295 300 

Ser Gly Phe Val Asn He Val Gly Gly Cys Cys Gly Thr Thr Pro Lys 
305 310 315 320 

His He Ala Ala He Ala Glu Ala Val Arg Asp He Pro Pro Arg Pro 
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325 330 335 

Leu Pro Asp He Pro Lys Lys Leu Arg Leu Ser Gly Leu Glu Pro Leu 
340 345 350 

Asn He Asp Glu His Ser Leu Phe Val Asn Val Gly Glu Arg Thr Asn 
355 360 365 

Val Thr Gly Ser Lys Ala Phe Ala Arg Leu He Leu Asn Gly Gly Tyr 
370 375 380 

Ala Glu Gly Leu Val He Ala Arg Ser Gin Val Glu Asn Gly Ala Gin 
385 390 395 400 

He He Asp He Asn Met Asp Glu Ala Met Leu Asp Ser Gin Lys Ala 
405 410 415 

Met Val Thr Phe Leu Asn Leu Leu Ala Ala Glu Pro Aap He Ser Arg 
420 425 430 

Leu Pro He Met Leu Asp Ser Ser Lys Trp Ser Val He Glu Ala Gly 
435 440 445 

Leu Lys Cys Val Gin Gly Lys Ala Val He Asn Ser lie Ser Leu Lys 
450 455 460 

Glu Gly Glu Ala Glu Phe Leu His His Ala Arg Leu Ala Arg Arg Tyr 
465 470 475 480 

Gly Ala Ala Val He Val Met Ala Phe Asp Glu Thr Gly Gin Ala Asp 
485 490 495 

Thr Leu Gin Arg Lys Val Glu He Cys Thr Arg Cys Tyr His Thr Leu 
500 505 510 

He Glu Gin Ala Asp Phe Pro Pro Glu Asp He He Phe Asp Pro Asn 
515 520 525 

He Phe Ala He Ala Thr Gly He Glu Glu His Ser Asn Tyr Ala Val 
530 535 540 

Asp Phe He Glu Ala Thr His Val He Arg Gin Thr Leu Pro Tyr Ala 
545 550 555 560 

Lys Val Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn 
565 570 575 

Glu Pro He Arg Glu Ala He His Thr Ala Phe Leu Tyr His Ala Val 
580 585 590 

Lys Ala Gly Met Thr Met Gly He Val Asn Ala Gly Gin Leu Gly Val 
595 600 605 

Tyr Ser Asp He Pro Pro Asp Leu Leu Glu His Val Glu Asp Val Leu 
61 0 615 620 

Leu Asn Arg Arg Pro Asp Ala Thr Glu Arg Leu Val Glu Phe Ala Glu 
625 630 635 640 

His Phe Lys Gly Gin Lys Lys Glu Gin He Glu Asp Leu Ser Trp Arg 
645 650 655 
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Asp Glu Pro Val Arg Gin Arg Leu He His Ala Leu Val Arg Gly He 
660 665 670 

Ser Thr Tyr He Val Glu Asp Thr Glu Leu Val Arg Gin Glu lie Asp 
675 680 685 

Ser Gin Gly Gly Lys Pro He Glu Val He Glu Gly Pro Leu Met Asp 
690 695 700 

Gly Met Asn Val Val Gly Asp Leu Phe Gly Ala Gly Lys Met Phe Leu 
7 °5 710 715 720 

Pro Gin Val Val Lys Ser Ala Arg Val Met Lys Gin Ala Val Ala Tyr 
725 730 735 

Leu Leu Pro Tyr He Glu Ala Glu Lys Lys lie Ser Gly Asp Ser Lys 
740 745 750 

Pro Lys Gly Lys Val Val He Ala Thr Val Lys Gly Asp Val His Asp 
755 760 765 

He Gly Lys Asn He Val Ser Val Val Leu Gin Cys Asn Asn Phe Glu 
770 775 780 

Val He Asn Met Gly Val Met Val Pro Ser Ala Gin He Leu Glu Thr 
785 790 795 800 

Ala Arg Arg Glu Gin Val Asp Met He Gly Leu Ser Gly Leu He Thr 
805 810 815 

Pro Ser Leu Glu Glu Met Ala His Val Ala Arg Glu Met Glu Arg Glu 
820 825 830 

Gin Phe Thr Val Pro Leu Leu He Gly Gly Ala Thr Thr Ser Arg Met 
835 840 845 

His Thr Ala Val Lys He Ala Pro His Tyr Gly Gly Val Thr Val Trp 
850 855 860 

Val Pro Asp Ala Ser Arg Ala Val Gly Val Cys Ser Asn Leu Met Ser 
665 870 875 880 

Gin Asp Leu Arg Asp Asp Tyr Val Arg Gin Val Lys Ala Glu Gin Glu 
885 890 895 

Lys Ser Arg Val Gin His Arg Asn Lys Lys Gly Pro Ser Lys Leu Leu 
900 905 910 

Thr Phe Glu Glu Ala Arg Ala Asn Ala Leu Lys Thr Asp Trp Ala Arg 
915 920 925 

Tyr Thr Pro Pro Ala Pro Asp Phe Leu Gly Leu Arg Thr Leu Asn Asn 
930 935 940 

Tyr Pro Leu Glu Thr Leu Val Pro His He Asp Trp Thr Pro Phe Phe 
945 950 955 960 

Gin Ala Trp Glu Leu His Gly Arg Tyr Pro Ala He Leu Gin Asp Glu 
965 970 975 

Leu Val Gly Glu Ala Ala Ser Asn Leu Phe Arg Asp Ala Gin Asn Met 
980 985 ~ 990 
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Leu Arg Lys lie Val Glu Gin Lys Trp Leu Thr Ala Asn Ala Val He 
995 1000 1005 

Gly Leu Phe Pro Ala Asn Thr Val Asn Gly Asp Asp He Glu He Tyr 
1010 1015 1020 

Ala Asp Arg Ser Arg Ser Gin Val He Met Thr Trp His Thr Leu Arg 
1Q 25 1030 1035 1040 

Gin Gin Thr Ala Lys Pro Ala Gly Arg Pro Asn Leu Ala Leu Ala Asp 
1045 1050 1055 

Phe He Ala Pro Arg Glu Thr Gly Leu Asp Asp Thr He Gly Leu Phe 
1060 1065 1070 

Ala Val Ser Ala Gly Phe Gly He Asp Glu Arg He Arg Ala Phe Glu 
1075 1080 1085 

Ala Ala Asn Asp Asp Tyr Ser Ala He He Leu Lys Ala Leu Ala Asp 
1090 1095 1100 

Arg Leu Ala Glu Ala Phe Ala Glu His Met His Ala Arg Val Arg Arg 
H05 mo ins H20 

Glu Phe Trp Gly Tyr Val Lys Asp Glu Ser Leu Asp Asn Glu Gin Leu 
1125 H30 1135 

He Asp Glu Gin Tyr Leu Gly He Arg Pro Ala Pro Gly Tyr Pro Ala 
H40 1145 1150 

Cys Pro Asp His Thr Glu Lys Gly Pro Leu Phe Ala Leu Leu Glu Ala 
II 55 H60 H65 

Glu Lys Arg Ser Gly He Val He Thr Glu Ser Phe Ala Met Val Pro 
1170 1175 1180 

Thr Ala Ala Val Ser Gly Phe Tyr Leu Ser Tyr Pro Glu Ser Ser Tyr 
ll 85 H90 1195 1200 

Phe Ala Val Gly Lys He Gly Lys Asp Gin Val Glu Asp Tyr Ala Arg 
1205 1210 1215 

Arg Lys Gly Trp Thr Leu Glu Glu Ala Glu Arg Trp Leu Ala Pro Val 
1220 1225 1230 

Leu Ala Tyr Glu Arg 
1235 



<210> 31 
<211> 3774 
<212> DNA 

<213> Bordetella pertussis 

<220> 
<221> CDS 
<222> (1) . . (3771) 
<223> RBP00104 



<220> 

<22l> unsure 
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<222> 205 . . 205 

<223> All occurrences of n indicate any nucleotide 
<220> 

<221> unsure 
<222> 277 . . 277 



<223> All occurrences of n indicate any nucleotide 



<400> 31 

gtg cct tat ccc cgt ate ccc ttc ccg ctg tec gee tac acg cat ggc 48 
Val Pro Tyr Pro Arg He Pro Phe Pro Leu Ser Ala Tyr Thr His Gly 
1 5 10 15 

ggc gag ttc gtc cgc caa ctg gac aag cgc ate ctg ate ctg gat ggt 96 
Gly Glu Phe Val Arg Gin Leu Asp Lys Arg He Leu lie Leu Asp Gly 
20 25 30 

gee atg ggc acg atg ate cag cgc tac aag ctg ggc gag gee gat ttc 144 
Ala Met Gly Thr Met He Gin Arg Tyr Lys Leu Gly Glu Ala Asp Phe 
35 40 45 

cgt ggc gag cgc ttc gec gag cac cac aag gat etc aag ggc gac aac 192 
Arg Gly Glu Arg Phe Ala Glu His His Lys Asp Leu Lys Gly Asp Asn 
50 55 60 

gaa ctg ctg teg ntg gtg cgc ccg gac gtg ate gcg gaa ate cac egg 240 
Glu Leu Leu Ser Xaa Val Arg Pro Asp Val He Ala Glu He His Arg 
65 70 75 80 

cag tac etc gag gec ggc gee gac gtg ate gag ace nac ace ttc ggc 288 
Gin Tyr Leu Glu Ala Gly Ala Asp Val He Glu Thr Xaa Thr Phe Gly 
85 90 95 

gec acg teg ate gee cag ggc gat tac gac ctg ccg gag ctg gee tac 336 
Ala Thr Ser He Ala Gin Gly Asp Tyr Asp Leu Pro Glu Leu Ala Tyr 
!00 105 no 

gag atg aac ctg gag teg gec cgc ctg gcg cgc gee gec tgc gac gee 384 
Glu Met Asn Leu Glu Ser Ala Arg Leu Ala Arg Ala Ala Cys Asp Ala 
US 120 125 

tac age acg ccc gag cat ccg cgc ttc gtg gee ggg gcg ctg ggg ccg 432 
Tyr Ser Thr Pro Glu His Pro Arg Phe Val Ala Gly Ala Leu Gly Pro 
130 135 140 



cag ccc aag ace gcg tec ate teg ccc gac gtc aac gac ccg ggg gcg 
Gin Pro Lys Thr Ala Ser He Ser Pro Asp Val Asn Asp Pro Gly Ala 
145 150 155 



160 



480 



cgc aac gtc ace ttc gac gag ctg cgc gcg gec tat gtc gag cag etc 528 
Arg Asn Val Thr Phe Asp Glu Leu Arg Ala Ala Tyr Val Glu Gin Leu 
165 170 175 

aat ggc ctg etc gac ggc ggc ate gac ate gtc ctg ate gaa acc ate 576 
Asn Gly Leu Leu Asp Gly Gly He Asp He Val Leu He Glu Thr He 
180 i 8 5 190 

ttc gat acg etc aac gee aag gcg gee ate ttc gec gtc gag gaa gcg 624 
Phe Asp Thr Leu Asn Ala Lys Ala Ala He Phe Ala Val Glu Glu Ala 
135 200 205 

ttc gag gcg cgc ggc gtg cgc ctg ccg gtg atg att teg ggc acc gtg 672 
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Phe Glu Ala Arg Gly Val Arg Leu Pro Val Met lie Ser Gly Thr Val 
210 215 220 

acc gat gcg teg ggc cgc ate ctg tec ggc cag ace gtc gag gcg ttc 720 
Thr Asp Ala Ser Gly Arg He Leu Ser Gly Gin Thr Val Glu Ala Phe 
225 230 235 240 

tgg aac teg gtg cgc cat gcg egg ccg gtc acc ate ggc ctg aac tgc 768 
Trp Asn Ser Val Arg His Ala Arg Pro Val Thr He Gly Leu Asn Cys 
245 250 255 

gcg ctg ggc gcg gcg ctg atg cgt ccg tat gtg gee gag ctg tec aag 816 
Ala Leu Gly Ala Ala Leu Met Arg Pro Tyr Val Ala Glu Leu Ser Lys 
260 265 270 

ate tgc gac acc tat gtg tgc gtc tat ccc aac gee ggc ctg ccc aat 864 
He Cys Asp Thr Tyr Val Cys Val Tyr Pro Asn Ala Gly Leu Pro Asn 
275 280 285 

ccc atg gec gag acg ggc ttt gac gaa acg ccg gee gat acc teg gee 912 
Pro Met Ala Glu Thr Gly Phe Asp Glu Thr Pro Ala Asp Thr Ser Ala 
290 295 300 

ctg ctg gaa gag ttc gec cag gee ggg ctg gtc aac atg gee ggc ggc 960 
Leu Leu Glu Glu Phe Ala Gin Ala Gly Leu Val Asn Met Ala Gly Gly 
305 310 315 320 

tgt tgc ggc acc acg ccc gag cac ate cgc gee ate gec ggc aag gtg 1008 
Cys Cys Gly Thr Thr Pro Glu His He Arg Ala He Ala Gly Lys Val 
325 330 335 

gee gcg ctg acg ccg cgc gcg gtg ccc gag gtg ccg gtc aag acc cgc 1056 
Ala Ala Leu Thr Pro Arg Ala Val Pro Glu Val Pro Val Lys Thr Arg 
340 345 350 

ctg teg ggc ctg gag gcg etc aac ate gac gac gag act ctg ttc gtc 1104 
Leu Ser Gly Leu Glu Ala Leu Asn He Asp Asp Glu Thr Leu Phe Val 
355 360 365 

aac gtg ggc gag cgc acc aac gtg acg ggc age aag atg ttc gee cgc 1152 
Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Lys Met Phe Ala Arg 
370 375 380 

ctg gtc cgc gag gag aaa tac gac gag gcg ctg gee gtg gcg cgc cag 1200 
Leu Val Arg Glu Glu Lys Tyr Asp Glu Ala Leu Ala Val Ala Arg Gin 
385 390 395 400 

cag gtc gag aac ggg gee cag ate ate gac gtc aac atg gac gag gcg 1248 
Gin Val Glu Asn Gly Ala Gin He He Asp Val Asn Met Asp Glu Ala 
405 410 415 



atg ctg gac teg gtg gec tgt atg cac cgc ttc etc aac ctg ate gcg 
Met Leu Asp Ser Val Ala Cys Met His Arg Phe Leu Asn Leu He Ala 
420 425 430 



1296 



tec gag ccc gac ate gcg egg gtg ccg gtg atg ate gac agt tec aag 1344 
Ser Glu Pro Asp He Ala Arg Val Pro Val Met He Asp Ser Ser Lys 
435 440 445 

tgg gaa gtg ate gag acc ggc ctg aag tgc gtg cag ggc aag gee gtg 1392 
Trp Glu Val He Glu Thr Gly Leu Lys Cys Val Gin Gly Lys Ala Val 
450 455 460 
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gtc aac teg ate tec atg aag gaa ggc gag gag ccg ttc cgc cat cat 1440 

Val Asa Ser lie Ser Met LyB Glu Gly Glu Glu Pro Phe Arg His His 
4 *5 470 475 480 

gcg cgc ctg tgc cgc cgc tac ggc gcg gec atg gtg gtc atg gec ttc 1488 
Ala Arg Leu Cys Arg Arg Tyr Gly Ala Ala Met Val Val Met Ala Phe 
48S 490 495 

gac gaa cag ggg cag gec gac teg ctg. gag cgc cgc aag gaa ate tgc 1536 
Asp Glu Gin Gly Gin Ala Asp Ser Leu Glu Arg Arg Lys Glu lie Cys 
500 505 " 510 

ggc cgc gee tac cgt ate ctg gtc gag gaa gag ggc ttc ccg ccc gag 1584 
Gly Arg Ala Tyr Arg He Leu Val Glu Glu Glu Gly Phe Pro Pro Glu 
515 520 525 

gac ate ate ttc gat ccc aac gtg ttc gcg gtg gee ace ggc ate gac 1632 
Asp lie He Phe Asp Pro Asn Val Phe Ala Val Ala Thr Gly He Asp 
530 535 540 

gaa cac aat cac tac gee gtc gat ttc ate gaa ggc gcg cgc tgg ate 1680 
Glu His Asn His Tyr Ala Val Asp Phe He Glu Gly Ala Arg Trp He 
545 550 555 " 560 

cgc gcg aac ctg ccg cat gee cgc att teg ggc ggc ate teg aac gtc 1728 
Arg Ala Asn Leu Pro His Ala Arg He Ser Gly Gly He Ser Asn Val 
565 570 , 575 

age ttc teg ttc cgc ggc aac gag ccg atg cgc gag gcg ate cat ace 1776 
Ser Phe Ser Phe Arg Gly Asn Glu Pro Met Arg Glu Ala He His Thr 
580 585 590 

gtc ttc ctg tac tac gee ate gag gee ggc ctg acg atg ggc ate gtc 1824 
Val Phe Leu Tyr Tyr Ala He Glu Ala Gly Leu Thr Met Gly He Val 
595 600 605 

aac gcg ggc cag ctg ggc gta tat gee gac ctg gcg ccg cac ctg cgc 1872 
Asn Ala Gly Gin Leu Gly Val Tyr Ala Asp Leu Ala Pro His Leu Arg 
610 615 620 

gac ctg gtc gag gac gtc ate ctg gac cgc ccc gag ccg gtg ggc cgc 1920 
Asp Leu Val Glu Asp Val He Leu Asp Arg Pro Glu Pro Val Gly Arg 
625 630 635 640 

age gac teg gec gac gag cgc teg ccc ace gaa egg ctg gtg cag ttt 1968 
Ser Asp Ser Ala Asp Glu Arg Ser Pro Thr Glu Arg Leu Val Gin Phe 
645 650 " 655 

gec gag acc gtc aag ggc teg ggc gcg aag aag gaa gaa gac ctg ace 2016 
Ala Glu Thr Val LyB Gly Ser Gly Ala Lys Lys Glu Glu Asp Leu Thr 
660 665 670 

tgg cgc acc ggc teg gtc gag cag cgc ctg gcg cat gee ctg gtg cac 2064 
Trp Arg Thr Gly Ser Val Glu Gin Arg Leu Ala His Ala Leu Val His 
675 680 685 

ggc ate acc acc ttc ate gtc gag gac acc gag gaa gtg cgc cag cag 2112 
Gly He Thr Thr Phe He Val Glu Asp Thr Glu Glu Val Arg Gin Gin 
690 695 700 

gtc gec gcg cgc ggc ggg cgc acc ate gaa gtg ate gaa ggt ccg ctg 2160 
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Val Ala Ala Arg Gly Oly Arg Thr He Glu Val He Glu Gly Pro Leu 
705 710 715 720 

atg gac ggc atg aac gtg gtc ggc gac ctg ttc ggc gcg ggc aag atg 2208 
Met Asp Gly Met Asn Val Val Gly Asp Leu Phe Gly Ala Gly Lye Met 
725 730 735 

ttc ctg ccg caa gtg gtg aag teg gcg cgc gtg atg aag cag gcg gtg 2256 
Phe Leu Pro Gin Val Val Lys Ser Ala Arg Val Met Lys Gin Ala Val 
740 745 750 



gcg cac ctg att ccc ttc ate gag gag gaa aag cgc cag ate gcg gec 
Ala His Leu He Pro Phe He Glu Glu Glu Lys Arg Gin He Ala Ala 
755 760 765 



cgc ccg cag ate gac tgg tec ggc tac cag ccg ccg cgc ccc aag ttc 
Arg Pro Gin He Asp Trp Ser Gly Tyr Gin Pro Pro Arg Pro Lys Phe 
945 550 955 * 96Q 



2304 



gcg ggc ggc gat gtg cgc gee aag ggc aag ate gtg ate gee ace gtc 2352 
Ala Gly Gly Asp Val Arg Ala Lys Gly Lys He Val He Ala Thr Val 
770 775 780 

aag ggc gac gtg cac gac ate ggc aag aac ate gtg teg gtg gtc ttg 2400 
Lys Gly Asp Val His Asp He Gly Lys Asn He Val Ser Val Val Leu 
785 790 795 800 

cag tgc aat aac ttc gaa gtc gtg aac atg ggc gtg atg gtg ccg tgc 2448 
Gin Cys Asn Asn Phe Glu Val Val Asn Met Gly Val Met Val Pro Cys 
80S 810 815 

gec cag ate ctg cag aag gee aag gac gag aac gee gac atg ate ggc 2496 
Ala Gin He Leu Gin Lys Ala Lys ABp Glu Asn Ala Asp Met He Gly 
B20 825 830 

ctg tec ggc ctg ate acg ccc age etc gaa gag atg gec tac gtg get 2544 
Leu Ser Gly Leu He Thr Pro Ser Leu Glu Glu Met Ala Tyr Val Ala 
835 840 845 

tea gaa atg cag cgc gac ccc tat ttc cgc gag cgc gee atg ccg ctg 2592 
Ser Glu Met Gin Arg Asp Pro Tyr Phe Arg Glu Arg Ala Met Pro Leu 
850 855 860 

atg ata ggc ggg gcg acc ace age egg gtc cat acg gcg gtc aag ate 2640 
Met He Gly Gly Ala Thr Thr Ser Arg Val His Thr Ala Val Lys He 
865 8 70 875 880 

gcg ccc aac tac gac ggt ccg gtg ate tac gtg ccc gat gee age cgt 2688 
Ala Pro Asn Tyr Asp Gly Pro Val lie Tyr Val Pro Asp Ala Ser Arg 
88 5 890 895 

teg gtc ggc gtg gcg acc age etc atg tec gac cag gee ccg gee tat 2736 
Ser Val Gly Val Ala Thr Ser Leu Met Ser Asp Gin Ala Pro Ala Tyr 
9 °0 905 910 

ttg gcg gag ctg gcg cag gag tac gag gat gtg cgc cgc tgc cat gee 2784 
Leu Ala Glu Leu Ala Gin Glu Tyr Glu Asp Val Arg Arg Cys His Ala 
315 920 925 

aac cgc aag gcg gtg ccg ctg gtg teg ctg gee gag gcg cgc gcg gcg 2832 
Asn Arg Lys Ala Val Pro Leu Val Ser Leu Ala Glu Ala Arg Ala Ala 
930 935 940 



2880 
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ctg ggc egg cgc gec ttc aag age tac gac ctg gee gag ate gcg cgc 2928 
Leu Gly Arg Arg Ala Phe Lys Ser Tyr Asp Leu Ala Glu He Ala Arg 
965 970 ,975 

tat ate gac tgg ggg ccg ttc ttc cag acg tgg age ctg ttc ggc ccg 2976 
Tyr He Asp Trp Gly Pro Phe Phe Gin Thr Trp Ser Leu Phe Gly Pro 
980 985 990 

ttc ccc gec ate ctg gac gac aag gtg .gtg ggc gag cag gcg cgc aag 3024 
Phe Pro Ala He Leu Aep Asp Lys Val Val Gly Glu Gin Ala Arg Lys 
995 1000 1005 

gtc tac gag gaa ggc cag gec atg etc aag cgc ate ate gac ggg cgc 3072 
Val Tyr Glu Glu Gly Gin Ala Met Leu Lys Arg He He Asp Gly Arg 
1010 1015 1020 

tgg ctg acc gec age ggc gtg gtc ggc ttc tat ccg gee aac cgc gtc 3120 
Trp Leu Thr Ala Ser Gly Val Val Gly Phe Tyr Pro Ala Asn Arg Val 
102 5 1030 1035 1040 

aat gac gaa gac ate gag gtc tac gcg gac gag acg cgc age gag atg 3168 
Asn Asp Glu Asp He Glu Val Tyr Ala Asp Glu Thr Arg Ser Glu Met 
1045 1050 ~ 1055 

ctg ttc acc tac cgc aac ctg cgc cag cag ggc gtc aag cgc gaa ggc 3216 
Leu Phe Thr Tyr Arg Asn Leu Arg Gin Gin Gly Val Lys Arg Glu Gly 
1060 1065 1070, 

gtc age aac aag tgc ctg gee gac tac ate gcg ccg cgc gac age ggc 3264 
Val Ser Asn Lys Cys Leu Ala Asp Tyr He Ala Pro Arg Asp Ser Gly 
1075 1080 1085 

ctg etc gac tac ate ggc atg ttc gee gtg acc gcg ggc ctg ggc ate 3312 
Leu Leu ABp Tyr He Gly Met Phe Ala Val Thr Ala Gly Leu Gly He 
1090 1095 lioo 

gag aag aaa gag gee gag ttc cag gcg gcg ctg gac gac tac tec age 3360 
Glu Lys Lys Glu Ala Glu Phe Gin Ala Ala Leu Asp Asp Tyr Ser Ser 
1105 IHO 1115 * 1120 

ate atg ctg aag teg ctg gec gac egg ctg gee gag gcg ttc gee gaa 3408 
He Met Leu Lys Ser Leu Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu 
H25 H30 H35 

tgc atg cac gcg cgc gtg cgc cgc gac ctg tgg ggc tac gcg gcg gac 3456 
Cys Met His Ala Arg Val Arg Arg Asp Leu Trp Gly Tyr Ala Ala Asp 
1140 1145 H50 

gag gcg ctg tec aac gat gag ctg ate gec gag aag tac age ggc ate 3504 
Glu Ala Leu Ser Asn Asp Glu Leu He Ala Glu Lys Tyr Ser Gly He 
1155 H60 H65 



egg ccg gcg ccc ggc tat ccg gee tgc ccg gag cac gtg gtc aag acg 
Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Glu His Val Val Lys Thr 
H70 H75 H80 



3552 



gac ctg ttc cgc gtg ctg gac gee gee gac gtc gga atg gag ctg acc 3600 

Asp Leu Phe Arg Val Leu Asp Ala Ala Asp Val Gly Met Glu Leu Thr 
1185 H90 H95 1200 

gac age tac gec atg ttc ccg gec tec age gtc teg ggg ttc tat ttc 3648 
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Asp ser Tyr Ala Met Phe Pro Ala Ser Ser Val Ser Gly Phe Tyr Phe 
1205 1210 1215 

age cac ccc gag teg cag tat ttc aac gtg ggc aac ate ggc gec gac 3696 
Ser His Pro Glu Ser Gin Tyr Phe Aan Val Gly Aen lie Gly Ala Asp 
1220 1225 1230 

cag ctg gee gac tac gtg gcg cgc age ggc cgc gee gaa gag gac gtg 3744 
Gin Leu Ala Asp Tyr Val Ala Arg Ser Gly Arg Ala Glu Glu Asp Val 
1235 1240 1245 

cgc cgc acc ctg gcg ccg aac ctg ggc tag 3774 
Arg Arg Thr Leu Ala Pro Asn Leu Gly 
1250 1255 



<210> 32 
<211> 1257 
<212> PRT 

<213> Bordetella pertussis 
<220> 

<22l> unsure 
<222> 69 . . 69 

<223> All occurrences of Xaa indicate any amino acid 
<220> 

<221> unsure 
<222> 93 . . 93 

<223> All occurrences of Xaa indicate any amino acid 
<400> 32 

Val Pro Tyr Pro Arg He Pro Phe Pro Leu Ser Ala Tyr Thr His Gly 
15 10 15 

Gly Glu Phe Val Arg Gin Leu Asp Lys Arg He Leu He Leu Asp Gly 
20 25 30 

Ala Met Gly Thr Met He Gin Arg Tyr Lys Leu Gly Glu Ala Asp Phe 
35 40 45 

Arg Gly Glu Arg Phe Ala Glu His His Lys Asp Leu Lys Gly Asp Asn 
50 55 60 

Glu Leu Leu Ser Xaa Val Arg Pro Asp Val He Ala Glu He His Arg 
65 70 75 80 

Gin Tyr Leu Glu Ala Gly Ala Asp Val He Glu Thr Xaa Thr Phe Gly 
85 90 95 

Ala Thr Ser He Ala Gin Gly Asp Tyr Asp Leu Pro Glu Leu Ala Tyr 
100 105 no 

Glu Met Asn Leu Glu Ser Ala Arg Leu Ala Arg Ala Ala Cys Asp Ala 
115 120 125 

Tyr Ser Thr Pro Glu His Pro Arg Phe Val Ala Gly Ala Leu Gly Pro 
130 135 140 

Gin Pro Lys Thr Ala Ser He Ser Pro Asp Val Asn Asp Pro Gly Ala 
145 150 155 i 6 o 
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Arg Asn Val Thr Phe Asp Glu Leu Arg Ala Ala Tyr Val Glu Gin Leu 
165 170 175 

Abu Gly Leu Leu Asp Gly Gly He Asp He Val Leu He Glu Thr He 
180 105 190 ^ 

Phe Asp Thr Leu Asn Ala Lys Ala Ala He Phe Ala Val Glu Glu Ala 
195 200 205 

Phe Glu Ala Arg Gly Val Arg Leu Pro Val Met He Ser Gly Thr Val 
210 215 220 ' 

Thr Asp Ala Ser Gly Arg He Leu Ser Gly Gin Thr Val Glu Ala Phe 
225 2 30 235 240 

Trp Asn Ser Val Arg His Ala Arg Pro Val Thr lie Gly Leu Asn Cys 
24 5 250 255 

Ala Leu Gly Ala Ala Leu Met Arg Pro Tyr Val Ala Glu Leu Ser Lys 
2fi 0 265 270 

He Cys Asp Thr Tyr Val Cys Val Tyr Pro Asn Ala Gly Leu Pro Asn 
2? 5 260 285 

Pro Met Ala Glu Thr Gly Phe Asp Glu Thr Pro Ala Asp Thr Ser Ala 
2 *0 295 300 

Leu Leu Glu Glu Phe Ala Gin Ala Gly Leu Val Asn Met Ala Gly Gly 
305 310 315 320 

Cys Cys Gly Thr Thr Pro Glu His He Arg Ala He Ala Gly Lys Val 
325 330 335 

Ala Ala Leu Thr Pro Arg Ala Val Pro Glu Val Pro Val Lys Thr Arg 
340 345 350 

Leu Ser Gly Leu Glu Ala Leu Asn He Asp Asp Glu Thr Leu Phe Val 
355 360 365 

Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Lys Met Phe Ala Ara 
370 375 380 

Leu Val Arg Glu Glu Lys Tyr Asp Glu Ala Leu Ala Val Ala Arg Gin 
385 390 395 400 

Gin Val Glu Asn Gly Ala Gin He He Asp Val Asn Met Asp Glu Ala 
405 410 415 

Met Leu Asp Ser Val Ala Cys Met His Arg Phe Leu Asn Leu He Ala 
420 425 430 

Ser Glu Pro Asp He Ala Arg Val Pro Val Met He Asp Ser Ser Lys 
4 35 440 445 

Trp Glu Val He Glu Thr Gly Leu Lys Cys Val Gin Gly Lye Ala Val 
45 ° 455 " 460 

Val Asn Ser He Ser Met Lys Glu Gly Glu Glu Pro Phe Arg His His 
465 4 ™ 475 480 

Ala Arg Leu Cys Arg Arg Tyr Gly Ala Ala Met Val Val Met Ala Phe 
4 «5 490 495 



I 
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Asp Glu Gin Gly Gin Ala Asp Ser Leu Glu Arg Arg Lye Glu He Cys 
500 505 510 

Gly Arg Ala Tyr Arg He Leu Val Glu Glu Glu Gly Phe Pro Pro Glu 
515 520 525 

Asp He lie Phe Asp Pro Asn Val Phe Ala Val Ala Thr Gly He Asp 
530 535 540 

Glu His ABn His Tyr Ala Val Asp Phe He Glu Gly Ala Arg Trp He 
545 550 555 560 

Arg Ala Asn Leu Pro His Ala Arg lie Ser Gly Gly lie Ser Asn Val 
565 570 575 

Ser Phe Ser Phe Arg Gly Asn Glu Pro Met Arg Glu Ala lie His Thr 
580 565 590 

Val Phe Leu Tyr Tyr Ala lie Glu Ala Gly Leu Thr Met Gly He Val 
595 600 605 

Asn Ala Gly Gin Leu Gly Val Tyr Ala Asp Leu Ala Pro His Leu Arg 
610 615 620 

Asp Leu Val Glu Asp Val lie Leu Asp Arg Pro Glu Pro Val Gly Arg 
"5 630 635 640 

Ser Asp Ser Ala Asp Glu Arg Ser Pro Thr Glu Arg Leu Val Gin Phe 
645 650 655 

Ala Glu Thr Val Lys Gly Ser Gly Ala LyB Lys Glu Glu Asp Leu Thr 
660 665 670 

Trp Arg Thr Gly Ser Val Glu Gin Arg Leu Ala His Ala Leu Val His 
675 680 685 

Gly He Thr Thr Phe lie Val Glu Asp Thr Glu Glu Val Arg Gin Gin 
690 695 700 

Val Ala Ala Arg Gly Gly Arg Thr lie Glu Val lie Glu Gly Pro Leu 
7 °S 710 715 720 

Met Asp Gly Met Asn Val Val Gly Asp Leu Phe Gly Ala Gly Lys Met 
725 730 735 

Phe Leu Pro Gin Val Val Lys Ser Ala Arg Val Met Lys Gin Ala Val 
740 745 750 

Ala His Leu lie Pro Phe He Glu Glu Glu Lys Arg Gin lie Ala Ala 
755 760 765 

Ala Gly Gly Asp Val Arg Ala Lys Gly Lys lie Val lie Ala Thr Val 
770 775 780 

Lys Gly Asp Val His Asp He Gly Lys Asn lie Val Ser Val Val Leu 
7 *5 790 795 800 

Gin Cys Asn Asn Phe Glu Val Val Asn Met Gly Val Met Val Pro Cys 
805 810 * 815 

Ala Gin lie Leu Gin Lys Ala Lys Asp Glu Asn Ala Asp Met lie Gly 
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820 825 830 

Leu Ser Gly Leu He Thr Pro Ser Leu Glu Glu Met Ala Tyr Val Ala 
835 840 845 

Ser Glu Met Gin Arg Asp Pro Tyr Phe Arg Glu Arg Ala Met Pro Leu 
850 855 860 

Met He Gly Gly Ala Thr Thr Ser Arg Val His Thr Ala Val Lys lie 
865 870 875 880 

Ala Pro Asn Tyr Asp Gly Pro Val He Tyr Val Pro Asp Ala Ser Arg 
885 890 895 

Ser Val Gly Val Ala Thr Ser Leu Met Ser Asp Gin Ala Pro Ala Tyr 
900 905 910 

Leu Ala Glu Leu Ala Gin Glu Tyr Glu Asp Val Arg Arg Cys His Ala 
915 920 925 

Asn Arg Lys Ala Val Pro Leu Val Ser Leu Ala Glu Ala Arg Ala Ala 
930 935 940 

Arg Pro Gin He Asp Trp Ser Gly Tyr Gin Pro Pro Arg Pro Lys Phe 
945 950 955 960 

Leu Gly Arg Arg Ala Phe Lys Ser Tyr Asp Leu Ala Glu He Ala Arg 
965 970 975 

Tyr He Asp Trp Gly Pro Phe Phe Gin Thr Trp Ser Leu Phe Gly Pro 
980 985 990 

Phe Pro Ala He Leu Asp Asp Lys Val Val Gly Glu Gin Ala Arg Lys 
995 1000 1005 

Val Tyr Glu Glu Gly Gin Ala Met Leu Lys Arg He He Asp Gly Arg 
1010 1015 1020 

Trp Leu Thr Ala Ser Gly Val Val Gly Phe Tyr Pro Ala Asn Arg Val 
1025 1030 1035 1040 

Asn Asp Glu Asp He Glu Val Tyr Ala Asp Glu Thr Arg Ser Glu Met 
1045 1050 1055 

Leu Phe Thr Tyr Arg Asn Leu Arg Gin Gin Gly Val Lys Arg Glu Gly 
1060 1065 1070 

Val Ser Asn Lys Cys Leu Ala Asp Tyr He Ala Pro Arg Asp Ser Gly 
1075 1080 1085 

Leu Leu Asp Tyr He Gly Met Phe Ala Val Thr Ala Gly Leu Gly He 
1090 1095 lioo 

Glu Lys Lys Glu Ala Glu Phe Gin Ala Ala Leu Asp Asp Tyr Ser Ser 
1105 mo ins 112Q 

He Met Leu Lys Ser Leu Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu 
1125 H30 H35 

Cys Met His Ala Arg Val Arg Arg Asp Leu Trp Gly Tyr Ala Ala Asp 
11*0 H45 H50 
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Glu Ala Leu Ser Asn Asp Glu Leu He Ala Olu Lys Tyr Ser Gly He 
1155 H60 ii65 

Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Glu His Val Val Lys Thr 
H ? 0 1175 HBO 

Asp Leu Phe Arg Val Leu Asp Ala Ala Asp Val Gly Met Glu Leu Thr 
1185 H90 U95 1200 

Asp Ser Tyr Ala Met Phe Pro Ala Ser Ser Val Ser Gly Phe Tyr Phe 
1205 1210 1215 

Ser His Pro Glu Ser Gin Tyr Phe Asn Val Gly Asn He Gly Ala Asp 
1220 1225 1230 

Gin Leu Ala Asp Tyr Val Ala Arg Ser Gly Arg Ala Glu Glu Asp Val 
1235 1240 1245 

Arg Arg Thr Leu Ala Pro Asn Leu Gly 
1250 1255 

<210> 33 
<211> 3645 
<212> DNA 

<213> Chlorobium tepidum 

<220> 
<221> CDS 
<222> (1) . . (3642) 
<223> RCL00420 

<400> 33 

gtg etc gac ggg gec atg ggc acc atg ate cag agg cat ggc etc gac 48 
Val Leu Asp Gly Ala Met Gly Thr Met He Gin Arg His Gly Leu Asp 
1 5 io i 5 

gaa cag gac tac egg ggc gag cgt ttc get teg cat gac cat ccg ctg 96 
Glu Gin Asp Tyr Arg Gly Glu Arg Phe Ala Ser His Asp His Pro Leu 
20 25 30 

aag ggc aac aac gac ctt ctt gtc ate acc egg ccc gac ate ate cgt 144 
Lys Gly Asn Asn Asp Leu Leu Val He Thr Arg Pro Asp He He Arg 
35 40 45 

teg ate cac tgc gac ttc etc gac gcg ggt gcg gac ate ate gag acc 192 
Ser He His Cys Asp Phe Leu Asp Ala Gly Ala Asp He He Glu Thr 
50 55 go 

tgc acc ttc aac gec aac ccg ate teg cag teg gac tac cag ttg cag 240 
Cys Thr Phe Asn Ala Asn Pro He Ser Gin Ser Asp Tyr Gin Leu Gin 
65 70 75 80 

gac ttg acc cgc gag ctg aac gtg gcg gcg gca aag ata gec cgc teg 288 
Asp Leu Thr Arg Glu Leu Asn Val Ala Ala Ala Lys He Ala Arg Ser 
85 90 95 

gca gcg gac gag ttc acc gca aag act ccc gac aag ccg cgt ttc gtg 336 
Ala Ala Asp Glu Phe Thr Ala Lys Thr Pro Asp Lys Pro Arg Phe Val 
100 105 no 



gec ggt tec ate gga ccg acc aac aag acg etc teg etc teg ccg gac 



384 
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Ala Gly Ser He Gly Pro Thr Aon Lye Thr Leu Ser Leu Ser Pro Asp 
115 120 125 

gtg aac aac ccc ggc ttc cgc gcc gtc acc ttc cag gag atg gtc gat 432 
Val Asn Asn Pro Gly Phe Arg Ala Val Thr Phe Gin Glu Met Val Asp 
130 135 140 

aac tac act gcc cag etc gaa ggc ttg cac gag ggc ggt gtc gat etc 480 
Asn Tyr Thr Ala Gin Leu Glu Gly Leu His Glu Gly Gly Val Asp Leu 
145 150 155 i 6 o 

ttg etc gtc gag acg gtg ttc gac aca ctg aac tgc aag gcg gcg etc 528 
Leu Leu Val Glu Thr Val Phe Asp Thr Leu Asn Cys Lys Ala Ala Leu 
165 170 175 

tac get ate gag gag tac gcg gtg aaa acc ggc tgg cag gtg ccc gtg 576 
Tyr Ala He Glu Glu Tyr Ala Val Lys Thr Gly Trp Gin Val Pro Val 
180 185 iso 

atg gtc tec ggc acg gtg gtg gac gcg age ggc cgc acc etc tec ggc 624 
Met Val Ser Gly Thr Val Val Asp Ala Ser Gly Arg Thr Leu Ser Gly 
195 200 205 

caa acc acc gag gcg ttc tgg att teg att teg cac atg ccg agt ctg 672 
Gin Thr Thr Glu Ala Phe Trp He Ser He Ser His Met Pro Ser Lei 
210 215 220 

etc teg gtc ggc ctg aac tgc gca etc ggc tec aag cag atg cgc ccc 720 
Leu Ser Val Gly Leu Asn Cys Ala Leu Gly Ser Lys Gin Met Arg Pro 
225 230 235 240 

ttc ate gag gcg etc teg aac ate gcc gaa age tac gtc age gtc tat 768 
Phe He Glu Ala Leu Ser Asn He Ala Glu Ser Tyr Val Ser Val Tyr 
245 250 255 

ccc aac gcg ggc ctg ccg aat gag ttc ggc gag tac gac gac tec ccc 816 
Pro Asn Ala Gly Leu Pro Asn Glu Phe Gly Glu Tyr Asp Asp Ser Pro 
260 265 270 

gag tac atg gcc gcg cag ate gcg ggc ttc gcc gaa tea ggc ttc gtg 864 
Glu Tyr Met Ala Ala Gin He Ala Gly Phe Ala Glu Ser Gly Phe Val 
275 280 285 

aac ate gtc ggc ggc tgc tgc ggc acc acg ccg acg cac ate cgc gcc 912 
Asn He Val Gly Gly Cys Cys Gly Thr Thr Pro Thr His He Arg Ala 
290 295 300 

att gcc gaa gcg gtc aag act etc ccg ccg aga aag cgc ccc gcc aac 960 
He Ala Glu Ala Val Lys Thr Leu Pro Pro Arg Lys Arg Pro Ala Asn 
305 310 315 320 

aag cac gtg ctg agg etc tec ggc etc gaa ccg etc gtg gtt gac gaa 1008 
Lys His Val Leu Arg Leu Ser Gly Leu Glu Pro Leu Val Val Asp Glu 
325 330 335 

acc acc ggc ttc ate aac gtc ggc gag cgc acc aac gtc acc ggt teg 1056 
Thr Thr Gly Phe He Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser 
340 345 350 

cgc aag ttc gcc cgc etc ate aag gag gcc aat tac gac gaa gcg etc 1104 
Arg Lys Phe Ala Arg Leu He Lys Glu Ala Asn Tyr Asp Glu Ala Leu 
355 360 365 
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tec att gee cgc cag cag gtc gag aac ggc gcg cag gtg ate gac gtg 1152 
Ser lie Ala Arg Gin Gin Val Glu Asn Gly Ala Gin Val lie Asp Val 
370 375 380 

aac etc gac gaa gga atg etc gac tec gaa aag gtg ate gtc gaa ttc 1200 
Aan Leu Asp Glu Gly Met Leu Aap Ser Glu Lys ?al lie 111 Glu III ° 
385 3 *° 395 400 

ctg aac etc ate gee tec gag cct gag ate gec aag gtg ccg gtg atg 1248 
Leu Aan Leu lie Ala Ser Glu Pro Glu lie Ala Lys Val Pro Val Met 
405 410 415 

ate gac teg teg aaa tgg teg gtc ate gaa aac ggc ctg cgc tgc acc 1296 
He Asp Ser Ser Lys Trp Ser Val lie Glu Asn Gly Leu Arg Cys Thr 
420 425 430 

cag ggc aag age ate gtc aac teg ate age etc aag gag ggc gag gag 1344 
Gin Gly Lys Ser lie Val Aan Ser lie Ser Leu Lys llu Gly 111 111 
435 440 445 

ctg ttc aag gag cgc get cgc aag ate atg caa tac ggc gcg gcg gcg 1392 
Leu Phe Lys Glu Arg Ala Arg Lys lie Met Gin Tyr §Iy III 111 12 
450 455 460 

Va? Va? m2 111 dk° I" 9 f 9 C f 9 " C C39 9CC 93C a 9 C Ct 9 CaC C 9 C 14 « 
Val Val Met Ala Phe Aap Glu Gin Gly Gin Ala Asp Ser Leu His Arq 

465 47 ° 475 480 

cgc ate gag att tgc age cgc gec tac aaa att etc acc gaa gag gtg 1488 
Arg lie Glu lie Cya Ser Arg Ala Tyr Lys lie Leu Thr Glu 111 Va! 

4 «5 490 495 

ggc ttc ccg ccg gag gac ate ate ttt gac ccg aac gtg ctg acc gtg 1536 
Gly Phe Pro Pro Glu Aap He lie Phe Asp Pro Asn Val Leu Thr Val 
500 505 sio 

!f° ff C 9 ? C atc 9ac 989 cac aac aac tac 9cg etc gac ttc ate gaa is 84 
Ala Thr Gly lie Aap Glu His Asn Asn Tyr 111 Leu Lp Phe lit 111 
515 520 525 

age gtg cgc tgg atc aag cag aac ctg ccg cac gcg aag gtc tec qqc 1632 
Ser Val Arg Trp He Lys Gin Asn Leu Pro His 111 Lyf ?" Ill 111 
530 535 540 

ggc atc age aac gtt teg ttc tec ttc cgc ggc aac gag ccg gtg cgc 1680 
Gly lie Ser Aan Val Ser Phe Ser Phe Arg Gly Asn Glu Pro Val A?g 
545 550 555 560 

gag gcg atg cac acc gcg ttc etc tac cac gec atc cac gee ggt etc 1728 
Glu Ala Met His Thr Ala Phe Leu Tyr His Ala lie His 111 lly Hi 
565 570 S75 

lit f? n? C 9t f SaC 9CC 9CC ca9 ctt " c atc tac 9aa gag atc 1776 
Aap Met Gly lie Val Asn Ala Ala Gin Leu Gly lie Tyr Glu Glu lie 
580 585 590 

gac ccg gag ctt ctt gtc tat gtc gag gac gtg ctg ctg aac cgc cgc 1824 
Aap Pro Glu Leu Leu Val Tyr Val Glu Asp Val Leu Leu Aan Arg Arg 
595 600 60S 

gac gac gee acc gag egg etc gtg gcg ttc get gaa acg atc cgc gac 1872 
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Asp Asp Ala Thr Glu Arg Leu Val Ala Phe Ala Olu Thr He Arq Asd 
610 615 620 

ggc ggc gaa aag gcc gag gcc aag aac gcc gaa tgg cgc aac gcc ccg 1920 
Gly Gly Glu Lys Ala Glu Ala Lys Asn Ala Glu Trp Arg Asn' Ala Pro 
625 «0 635 , 640 

gtc gag gag egg ctg aaa cac gcg etc gtc aag ggc ate gtt gac tac 1968 
Val Glu Glu Arg Leu Lye His Ala Leu Val Lys Gly He Val Asp Tyr 
645 650 655 

ate gac gag gac ace gaa gag gcc cgc cag etc tac ccg agt ccg ctg 2016 
He Asp Glu Asp Thr Glu Glu Ala Arg Gin Leu Tyr Pro Ser Pro Leu 
660 665 670 

gag gtg ate gag ggg ccg etc atg aac ggc atg aac cac gtc ggc gac 2064 
Glu Val He Glu Gly Pro Leu Met Asn Gly Met Asn His Val Gly Asd 
675 680 685 

etc ttc gcc gaa ggc aag atg ttc ctg cca cag gtg gtc aaa age gcc 2112 
Leu Phe Ala Glu Gly Lys Met Phe Leu Pro Gin Val Val Lye Ser Ala 
690 695 700 

cgc gtc atg aag cgc teg gta get gcg ctg att ccc tat ate gag gag 2160 
Arg Val Met Lys Arg Ser Val Ala Ala Leu He Pro Tyr He Glu Glu 
705 710 715 720 

gag aag teg aaa aac tgc gac acg age gcc aaa gcc aag gtg ctg etc 2208 
Glu Lys Ser Lys Asn Cys Asp Thr Ser Ala Lys Ala Lys Val Leu Leu 
725 730 735 

gcc acg gtg aag ggc gac gtg cac gac ate ggc aag aac ate gtg teg 2256 
Ala Thr Val Lys Gly Asp Val His Asp He Gly Lys Asn He Val Ser 
7 «0 745 750 

gtg gtg ctt gcc tgc aac aac ttc gac gtg ate gac ate ggc gtc atg 2304 
Val Val Leu Ala Cys Asn Asn Phe Asp Val He Asp He Gly Val Met 
7 55 760 765 

atg cca tgc gac aag att etc gaa gcg ctg gca gaa cac aag ccc gac 2352 
Met Pro Cys Asp Lys He Leu Glu Ala Leu Ala Glu His Lys Pro Asd 
770 775 780 

gtg etc ggc etc tec ggc etc ate acc ccg teg etc gaa gag atg gcg 2400 
Val Leu Gly Leu Ser Gly Leu He Thr Pro Ser Leu Glu Glu Met Ala 
785 790 795 800 

cac gtg gcc aaa gag atg gag egg etc ggc atg aac att ccg etc ate 2448 
His Val Ala Lys Glu Met Glu Arg Leu Gly Met Asn He Pro Leu He 
805 810 815 

ate ggc ggc gcg acc acc teg aag gtg cac acg gcg gtg aaa etc gcg 2496 
He Gly Gly Ala Thr Thr Ser Lys Val His Thr Ala Val Lys Leu Ala 
820 825 830 

ccc tgc tac ccc age ggc gcg gta gta cac gtg etc gac gcc teg cgc 2544 
Pro Cys Tyr Pro Ser Gly Ala Val Val His Val Leu Asp Ala Ser Arg 
835 840 845 

age gtg ccg gtg gtc age aac etc tgc aac ccc gcc cag cgc gac age 2592 
Ser Val Pro Val Val Ser Asn Leu Cys Asn Pro Ala Gin Arg Asp Ser 
850 855 860 
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tat ate gcg gcg ctg aag gat gag cag gag gcg atg cgc aag age cac 2640 
Tyr He Ala Ala Leu Lye Asp Glu Gin Glu Ala Met Arg Lye Ser H1b 
865 870 875 680 

gee gag cgc atg gcg gca aaa aag tac gtc teg etc gac gec gee cgc 2688 
Ala Glu Arg Met Ala Ala Lys Lys Tyr Val Ser Leu Asp Ala Ala Arg 
885 890 895 

gac aac cgc etc ace att gac tgg gag gec gaa ace ate gac aag ccc 2736 
Asp Asn Arg Leu Thr He Asp Trp Glu Ala Glu Thr He Asp Lys Pro 
900 905 910 

gee cag act ggc gtc ace gtg ctg gag gat gtc ace gtc ggc gcg etc 2784 
Ala Gin Thr Gly Val Thr Val Leu Glu Asp Val Thr Val Gly Ala Leu 
915 920 925 

cgc ccg tat ate gac tgg gca mcc ttc ttc tgg age tgg gag ctg cac 2832 
Arg Pro Tyr He Asp Trp Ala Xaa Phe Phe Trp Ser Trp Glu Leu His 
930 935 940 

ggc gtc tat ccg cag att ctg gag gat gaa aag gtc ggc gag gag gca 2880 
Gly Val Tyr Pro Gin He Leu Glu Asp Glu Lys Val Gly Glu Glu Ala 
945 950 955 960 

ace aaa etc ttc aac gac gee ace get ctg etc gac egg ate gac age 2928 
Thr Lys Leu Phe Asn Asp Ala Thr Ala Leu Leu Asp Arg He Asp Ser 
965 970 975 

gaa aag ctg etc ggc ate aaa ggc gtg gcg ggc ate ttc ccg gee aac 2976 
Glu Lys Leu Leu Gly He Lys Gly Val Ala Gly He Phe Pro Ala Asn 
980 985 990 

age ate ggc gac gac ate ttc gtc tat gcg gat gac gag cgc teg ata 3024 
Ser He Gly Asp Asp He Phe Val Tyr Ala Asp Asp Glu Arg Ser He 
995 1000 1005 

ate cgc acc gtg ctg cac ace ctg cgc cag caa ggc gaa aag cac ggc 3072 
He Arg Thr Val Leu His Thr Leu Arg Gin Gin Gly Glu Lys His Gly 
1010 1015 1020 

gaa gcg aac etc gcg ctg gcg gac ttc gtg gec ccg cgc gaa age ggc 3120 
Glu Ala Asn Leu Ala Leu Ala Asp Phe Val Ala Pro Arg Glu Ser Gly 
102 5 1030 1035 " 1040 

gtc aac gac tgg ate ggc tgc ttc acc gta acc gee gga etc ggc ate 3168 
Val Asn Asp Trp He Gly Cys Phe Thr Val Thr Ala Gly Leu Gly He 
1045 1050 1055 

cag aat ttg etc gac gag ttc aca gca gag aac gac gac tac cac cgc 3216 
Gin Asn Leu Leu Asp Glu Phe Thr Ala Glu Asn Asp Asp Tyr His Arg 
1060 1065 1070 

ate atg aca cag gcg etc gee gac cga ctg gee gaa gcg ttc gca gag 3264 
He Met Thr Gin Ala Leu Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu 
1075 1080 1085 

atg ctg cac gaa aag gtg cgc cgc gaa etc tgg ggc tac gcg ccc ggc 3312 
Met Leu His Glu Lys Val Arg Arg Glu Leu Trp Gly Tyr Ala Pro Gly 
1090 1095 1100 

gaa ate etc ggc aac gaa gag ctg ate gec gaa aag tac cga ggc ate 3360 
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Glu He Leu Gly Asn Glu Glu Leu He Ala Glu Lys Tyr Arg Gly He 
1105 1110 ins H20 

cgc ccc gcc ccc ggc tac ccc gcc tgc ccg gat cac acc gaa aag gca 3408 
Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Asp His Thr Glu Lys Ala 
ll 25 H30 ills 

ate ate ttc gac ctg etc aac get gaa gcg gcc acc ggc gtc acg ctg 3456 
He He Phe ABp Leu Leu Asn Ala Glu Ala Ala Thr Gly Val Thr Leu 
1140 1145, lisp 

acg gaa act ttc gcg atg aac ccc gca gcc tea gtc tgc ggc etc tac 3504 
Thr Glu Thr Phe Ala Met Asn Pro Ala Ala Ser Val Cys Gly Leu Tyr 
1155 H60 1165 

ttc gcc aac ccg gcc teg aaa tac ttc gta etc ggc aag att ggt aag 3552 
Phe Ala ABn Pro Ala Ser Lys Tyr Phe Val Leu Gly Lys He Gly Lys 
1170 1175 H80 

gat cag gtc gaa gac tac gcc aac cgc aaa ggg ctg gaa gta gca gaa 3600 
Asp Gin Val Glu Asp Tyr Ala Asn Arg Lys Gly Leu Glu Val Ala Glu 
ll 85 H90 H95 1200 

gcc gag aag tgg etc gcg ccc teg ctg aac tac gat cca gcg 3642 
Ala Glu Lys Trp Leu Ala Pro Ser Leu Asn Tyr Asp Pro Ala 
1205 1210 



taa 



<210> 34 
<211> 1214 
<212> PRT 

<213> Chlorobium tepiduro 
<220> 

<221> unsure 
<222> 936 936 

<223> All occurrences of Xaa indicate any amino acid 
<400> 34 

Val Leu Asp Gly Ala Met Gly Thr Met He Gin Arg His Gly Leu Asp 
15 io 15 

Glu Gin Asp Tyr Arg Gly Glu Arg Phe Ala Ser His Asp His Pro Leu 
20 25 30 

Lys Gly Asn Asn Asp Leu Leu Val He Thr Arg Pro Asp He He Arg 
35 40 45 

Ser He His Cys Asp Phe Leu Asp Ala Gly Ala Asp He He Glu Thr 
50 55 60 

Cys Thr Phe Asn Ala Asn Pro He Ser Gin Ser Asp Tyr Gin Leu Gin 
65 70 75 80 

Asp Leu Thr Arg Glu Leu Asn Val Ala Ala Ala Lys He Ala Arg Ser 
85 90 95 

Ala Ala Asp Glu Phe Thr Ala Lys Thr Pro Asp Lys Pro Arg Phe Val 
100 105 no 



3645 
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Ala Gly Ser He Gly Pro Thr Asn Lys Thr Leu Ser Leu Ser Pro Asp 
115 120 125 

Val Asn Asn Pro Gly Phe Arg Ala Val Thr Phe Gin Glu Met Val Asp 
130 135 140 

Asn Tyr Thr Ala Gin Leu Glu Gly Leu His Glu Gly Gly Val Asp Leu 
145 150 155 160 

Leu Leu Val Glu Thr Val Phe Asp Thr Leu Asn Cys Lys Ala Ala Leu 
165 170 175 

Tyr Ala He Glu Glu Tyr Ala Val Lys Thr Gly Trp Gin Val Pro Val 
180 185 190 

Met Val Ser Gly Thr Val Val Asp Ala Ser Gly Arg Thr Leu Ser Gly 
195 200 205 

Gin Thr Thr Glu Ala Phe Trp lie Ser He Ser His Met Pro Ser Leu 
210 215 220 

Leu Ser Val Gly Leu Asn Cys Ala Leu Gly Ser Lys Gin Met Arg Pro 
225 230 235 240 

Phe He Glu Ala Leu Ser Asn He Ala Glu Ser Tyr Val Ser Val Tyr 
245 250 255 

Pro Asn Ala Gly Leu Pro Asn Glu Phe Gly Glu Tyr Asp Asp Ser Pro 
260 265 270 

Glu Tyr Met Ala Ala Gin He Ala Gly Phe Ala Glu Ser Gly Phe Val 
275 280 285 

Asn He Val Gly Gly Cys Cys Gly Thr Thr Pro Thr His He Arg Ala 
290 295 300 

He Ala Glu Ala Val Lys Thr Leu Pro Pro Arg Lys Arg Pro Ala Asn 
305 310 315 ~ 320 

Lys His Val Leu Arg Leu Ser Gly Leu Glu Pro Leu Val Val Asp Glu 
325 330 335 

Thr Thr Gly Phe He Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser 
340 345 350 

Arg Lys Phe Ala Arg Leu He Lys Glu Ala Asn Tyr Asp Glu Ala Leu 
355 360 365 

Ser He Ala Arg Gin Gin Val Glu Asn Gly Ala Gin Val He Asp Val 
370 375 380 

Asn Leu Asp Glu Gly Met Leu Asp Ser Glu Lys Val He Val Glu Phe 
385 390 395 400 

Leu Asn Leu He Ala Ser Glu Pro Glu He Ala Lys Val Pro Val Met 
405 410 415 

He Asp Ser Ser Lys Trp Ser Val He Glu Asn Gly Leu Arg Cys Thr 
4 20 425 430 

Gin Gly Lys Ser He Val Asn Ser He Ser Leu Lys Glu Gly Glu Glu 
435 440 445 



WO 03/087386 PCT/EP03/04010 

150 

Leu Phe Lys Glu Arg Ala Arg Lys He Met Gin Tyr Gly Ala Ala Ala 
450 455 460 

Val Val Met Ala Phe Asp Glu Gin Gly Gin Ala Asp Ser Leu Hla Arg 
465 470 475 480 

Arg lie Glu He Cys Ser Arg Ala Tyr Lye He Leu Thr Glu Glu Val 
485 490 495 

Gly Phe Pro Pro Glu Asp He He Phe Asp Pro Asn Val Leu Thr Val 
500 505 510 

Ala Thr Gly He Asp Glu His Asn Asn Tyr Ala Leu Asp Phe He Glu 
515 520 525 

Ser Val Arg Trp He Lys Gin Asn Leu Pro His Ala Lys Val Ser Gly 
530 535 540 

Gly He Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Glu Pro Val Arg 
545 550 555 560 

Glu Ala Met His Thr Ala Phe Leu Tyr His Ala He His Ala Gly Leu 
565 570 575 

Asp Met Gly He Val Asn Ala Ala Gin Leu Gly He Tyr Glu Glu He 
580 585 * 590 

Asp Pro Glu Leu Leu Val Tyr Val Glu Asp Val Leu Leu Asn Arg Arg 
595 600 605 

Asp Asp Ala Thr Glu Arg Leu Val Ala Phe Ala Glu Thr He Arg Asp 
610 615 620 

Gly Gly Glu Lys Ala Glu Ala Lys Asn Ala Glu Trp Arg Asn Ala Pro 
"5 630 635 640 

Val Glu Glu Arg Leu Lys His Ala Leu Val Lys Gly lie Val Asp Tyr 
645 650 655 

He Asp Glu Asp Thr Glu Glu Ala Arg Gin Leu Tyr Pro Ser Pro Leu 
660 665 670 

Glu Val He Glu Gly Pro Leu Met Asn Gly Met Asn His Val Gly Asp 
675 680 685 

Leu Phe Ala Glu Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala 
690 695 700 

Arg Val Met Lys Arg Ser Val Ala Ala Leu He Pro Tyr He Glu Glu 
705 710 715 720 

Glu Lys Ser Lys Asn Cys Asp Thr Ser Ala Lys Ala Lys Val Leu Leu 
725 730 735 

Ala Thr Val Lys Gly Asp Val His Asp He Gly Lys Asn He Val Ser 
740 745 750 

Val Val Leu Ala Cys Asn Asn Phe Asp Val He Asp He Gly Val Met 
755 760 765 

Met Pro Cys Asp Lys He Leu Glu Ala Leu Ala Glu His Lys Pro Asp 



WO 03/087386 PCT/EP03/04010 

151 

770 775 780 

Val Leu Gly Leu Ser Gly Leu He Thr Pro Ser Leu Glu Glu Met Ala 
7 85 790 795 800 

His Val Ala Lys Glu Met Glu Arg Leu Gly Met Asn He Pro Leu He 
805 810 815 

He Gly Gly Ala Thr Thr Ser LyB Val Hie Thr Ala Val Lys Leu Ala 
820 825 830 

Pro Cys Tyr Pro Ser Gly Ala Val Val His Val Leu Asp Ala Ser Arg 
835 840 845 

Ser Val Pro Val Val Ser Asn Leu Cys Asn Pro Ala Gin Arg Asp Ser 
850 855 860 

Tyr He Ala Ala Leu Lys Asp Glu Gin Glu Ala Met Arg Lys Ser His 
8 « 870 875 880 

Ala Glu Arg Met Ala Ala Lys Lys Tyr Val Ser Leu Asp Ala Ala Arg 
885 890 895 

Asp Asn Arg Leu Thr He Asp Trp Glu Ala Glu Thr He Asp Lys Pro 
900 905 910 

Ala Gin Thr Gly Val Thr Val Leu Glu Asp Val Thr Val Gly Ala Leu 
915 920 925 

Arg Pro Tyr He Asp Trp Ala Xaa Phe Phe Trp Ser Trp Glu Leu His 
930 935 940 

Gly Val Tyr Pro Gin He Leu Glu Asp Glu Lys Val Gly Glu Glu Ala 
945 950 955 960 

Thr Lys Leu Phe Asn Asp Ala Thr Ala Leu Leu Asp Arg lie Asp Ser 
965 970 975 

Glu Lys Leu Leu Gly He Lys Gly Val Ala Gly He Phe Pro Ala Asn 
980 985 990 

Ser He Gly Asp Asp He Phe Val Tyr Ala Asp Asp Glu Arg Ser lie 
995 1000 1005 

lie Arg Thr Val Leu His Thr Leu Arg Gin Gin Gly Glu Lys His Gly 
1010 1015 1020 

Glu Ala Asn Leu Ala Leu Ala Asp Phe Val Ala Pro Arg Glu Ser Gly 
1025 1030 1035 1040 

Val Asn Asp Trp He Gly Cys Phe Thr Val Thr Ala Gly Leu Gly lie 
1045 1050 1055 

Gin Asn Leu Leu Asp Glu Phe Thr Ala Glu Asn Asp Asp Tyr His Arg 
1060 1065 1070 

lie Met Thr Gin Ala Leu Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu 
1075 1080 1085 

Met Leu His Glu Lys Val Arg Arg Glu Leu Trp Gly Tyr Ala Pro Gly 
1090 1095 iioo 
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Glu He Leu Gly Asn Glu Glu Leu He Ala Glu Lys Tyr Arg Gly He 
1105 IHO 1115 1120 

Arg Pro Ala Pro Gly Tyr Pro Ala Cye Pro Asp His Thr. Glu Lye Ala 
1125 H30 lias 

He He Phe Asp Leu Leu Asn Ala Glu Ala Ala Thr Gly Val Thr Leu 
H40 H45 1150 

Thr Glu Thr Phe Ala Met Asn Pro Ala Ala Ser Val Cys Gly Leu Tyr 
1155 H60 ii65 

Phe Ala Asn Pro Ala Ser Lys Tyr Phe Val Leu Gly Lys lie Gly Lys 
H70 H75 1100 

Asp Gin Val Glu Asp Tyr Ala Asn Arg Lys Gly Leu Glu Val Ala Glu 
1185 H90 U95 1200 

Ala Glu Lys Trp Leu Ala Pro Ser Leu Asn Tyr Asp Pro Ala 
1205 1210 

<210> 35 
<211> 3777 
<212> DNA 

<213> Delnococcus radiodurans 

<220> 
<221> CDS 
<222> (1) (3774) 
<223> RDR02645 

<400> 35 

atg age cat cac cca gaa gcg teg get tec gee aat ccg tec ate aac 46 
Met Ser His His Pro Glu Ala Ser Ala Ser Ala Asn Pro Ser He Asn 
15 io i 5 

cat caa ccg tec acc ate ace gag gee gec cgc cag cgc ate ctg att 96 
His Gin Pro Ser Thr lie Thr Glu Ala Ala Arg Gin Arg lie Leu He 
20 25 30 

etc gac ggc gee tgg ggt acg cag ctt cag cga gee aac etc acc gaa 144 
Leu Asp Gly Ala Trp Gly Thr Gin Leu Gin Arg Ala Asn Leu Thr Glu 
35 40 45 

gcg gac ttc cgc tgg gac gaa gee gac ccc acg egg atg tac egg ggc 192 
Ala Asp Phe Arg Trp Asp Glu Ala Asp Pro Thr Arg Met Tyr Arg Gly 
50 55 60 



240 



268 



aac ttc gac ctg ctg caa ctg acc aag cct gac gtg att cgc gec gtg 
Asn Phe Asp Leu Leu Gin Leu Thr Lys Pro Asp Val lie Arg Ala Val 
65 70 75 bo 

cac cgc gee tat ttc gag gee gga gcg gac ate gec age acc aat acc 
His Arg Ala Tyr Phe Glu Ala Gly Ala Asp He Ala Ser Thr Asn Thr 
85 90 95 

ttc aac tec acg acc ate teg cag gcg gat tac ggc acc gag gca ctg 336 
Phe Asn Ser Thr Thr lie Ser Gin Ala Asp Tyr Gly Thr Glu Ala Leu 
1°° 105 no 

gee tac gec atg aac cgc gag ggg gca agg ctg gec cgc gaa gtc gec 384 



WO 03/087386 PCT/EP03/04010 

153 

Ala Tyr Ala Met Asn Arg Glu Gly Ala Arg Leu Ala Arg Glu Val Ala 
115 120 12 5 

gac gag ttc gag gcg cgc gac ggc aaa aag cgc tgg gtg gcg ggg agt 432 
Asp Glu Phe Glu Ala Arg Asp Gly Lys Lys Arg Trp Val Ala Gly Ser 
130 135 140 

gtc ggt ccc acc aac cgc acc gcg acc ctt tct ccc gac gtg gag egg 480 
Val Gly Pro Thr Asn Arg Thr Ala Thr Leu Ser Pro Asp Val Glu Arg 
14 5 ISO 155 * 160 

ccc gag ttc cgc aac gtg acc tac gac gac etc gtg gcg gcg tac teg 528 
Pro Glu Phe Arg Asn Val Thr Tyr Asp Asp Leu Val Ala Ala Tyr Ser 
165 170 175 

gag gee ate acc ggg ttg atg gaa ggt ggc gcg gac ctg ctg etc att 576 
Glu Ala He Thr Gly Leu Met Glu Gly Gly Ala Asp Leu Leu Leu He 
180 185 190 

gaa acg gtg ttt gac acg ctg aac gee aaa gec gcg ctg ttt gec gcg 624 
Glu Thr Val Phe Asp Thr Leu Asn Ala Lys Ala Ala Leu Phe Ala Ala 
195 200 205 

cag gac gtg ttc gcg gcg cag ggg cgc gag ctg ccg gtc atg etc teg 672 
Gin Asp Val Phe Ala Ala Gin Gly Arg Glu Leu Pro Val Met Leu Ser 
210 215 220 

ggc acc ate acc gac gec teg ggc cgc acg ctg age ggg cag acg ccc 720 
Gly Thr He Thr Asp Ala Ser Gly Arg Thr Leu Ser Gly Gin Thr Pro 
225 230 235 240 

gaa gee ttc gcg gtg age acc gag cac gee ggc etc ttt teg ctg ggc 768 
Glu Ala Phe Ala Val Ser Thr Glu His Ala Gly Leu Phe Ser Leu Gly 
245 250 255 

ctg aac tgc gcg ctg ggc gee gac ctg ctg egg ccc cac ctg cgc gca 816 
Leu Asn Cys Ala Leu Gly Ala Asp Leu Leu Arg Pro His Leu Arg Ala 
260 265 270 

att gcg gcg aac acg gag gcg ctg gtg teg gtt cac ccc aac gcg ggc 664 
He Ala Ala Asn Thr Glu Ala Leu Val Ser Val His Pro Asn Ala Gly 
275 280 285 

etc ccc aac gee ttc ggg gaa tac gac gaa acg ccc gaa cac acg gcg 912 
Leu Pro Asn Ala Phe Gly Glu Tyr Asp Glu Thr Pro Glu His Thr Ala 
290 295 300 

gcg gtg ctg gec gac ttc gec cgc gag ggg ctg gtc aac ate gtg ggc 960 
Ala Val Leu Ala Asp Phe Ala Arg Glu Gly Leu Val Asn He Val Gly 
305 310 315 320 

ggc tgc tgc ggc acc aca ccc gag cac ate aaa gcg att gcg gag gcg 1008 
Gly Cys Cys Gly Thr Thr Pro Glu His He Lys Ala He Ala Glu Ala 
325 330 335 

gtg aag gac att ccc ccg cgc cag gcg ctg caa ctg ccg cct tac ctg 1056 
Val Lys Asp He Pro Pro Arg Gin Ala Leu Gin Leu Pro Pro Tyr Leu 
340 345 350 

cgc etc age ggc etc gaa gec ttc acc ctg acg ccg gaa acc aac ttc 1104 
Arg Leu Ser Gly Leu Glu Ala Phe Thr Leu Thr Pro Glu Thr Asn Phe 
355 360 365 
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gtc aac gtg ggc gag cgc acc aac gtg acc ggc agt ccc aag ttc age 1152 
Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Pro Lys Phe Ser 
370 375 380 

aag gcg att ctg gec ggc gac tac gac gca ggg etc aag att gec cgc 1200 
Lys Ala He Leu Ala Gly Asp Tyr Asp Ala Gly Leu Lys lie Ala Arg 
385 3 9° 395 400 

cag cag gtg acg aac ggc gcg caa ate gtg gac ate aac ttc gac gag 1248 
Gin Gin Val Thr Asn Gly Ala Gin He Val Asp He Asn Phe Asp Glu 
405 410 415 

ggg atg etc gac ggc gaa gga gcg atg gtc aag ttc etc aac ctg etc 1296 
Gly Met Leu Asp Gly Glu Gly Ala Met Val Lys Phe Leu Asn Leu Leu 
420 425 430 

gec ggg gag ccg gac ate teg cgc gtg ccc ctg atg etc gac teg tec 1344 
Ala Gly Glu Pro Asp He Ser Arg Val Pro Leu Met Leu Asp Ser Ser 
435 440 445 

aag tgg gag att ctg gaa gcg ggg ctg egg egg gtg cag ggc aag gca 1392 
Lys Trp Glu He Leu Glu Ala Gly Leu Arg Arg Val Gin Gly Lys Ala 
450 455 460 

gtc gtc aac tec ate teg etc aag gac ggc gag gee agg ttt ctg gaa 1440 
Val Val Asn Ser He Ser Leu Lys Asp Gly Glu Ala Arg Phe leu Glu 
465 470 475 480 

cgc gee egg ctg ctg egg cgc tac ggg gcg gcg gcg gtg gtc atg gee 1488 
Arg Ala Arg Leu Leu Arg Arg Tyr Gly Ala Ala Ala Val Val Met Ala 
485 490 495 

ttc gac gaa cag gga cag gee gac aac etc gee cga cgc egg gag att 1536 
Phe Asp Glu Gin Gly Gin Ala Asp Asn Leu Ala Arg Arg Arg Glu He 
500 505 * 510 

ctg ggc cgc gcg tat agg ctg ctg acc gag cag gcg gac ttt ccg ccg 1584 
Leu Gly Arg Ala Tyr Arg Leu Leu Thr Glu Gin Ala Asp Phe Pro Pro 
515 520 525 

cag gac ate att ttc gac ccc aac gtg ctg acc gtt gee acc ggc ate 1632 
Gin Asp He He Phe Asp Pro Asn Val Leu Thr Val Ala Thr Gly He 
530 535 540 



gag gaa cac gac cgc tac gcg ctg gac ttt ate gag gcg acg cgc tag 
Glu Glu His Asp Arg Tyr Ala Leu Asp Phe He Glu Ala Thr Arg Tro 
545 550 555 



1680 



560 



att aaa gaa aac ctg ccg gcg gcg aag gtg teg ggc ggg att tec aac 1728 
He Lys Glu Asn Leu Pro Ala Ala Lys Val Ser Gly Gly He Ser Asn 
565 570 575 

gtc teg ttc age ttc egg ggc aac aac cac gtg cgc gag gcg atg cac 1776 
Val Ser Phe Ser Phe Arg Gly Asn Asn His Val Arg Glu Ala Met His 
580 585 590 

gcg gtg ttt ctg tac cac gee ate cgc gee ggg ctg gac atg ggc ate 1824 
Ala Val Phe Leu Tyr His Ala He Arg Ala Gly Leu Asp Met Gly He 
595 600 605 

gtg aac gcg ggg atg ctg gcg gtg tac gag gac ate gag ccg gag ctg 1872 
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Val Asn Ala Gly Met Leu Ala Val Tyr Glu Asp He Glu Pro Glu Leu 
610 615 6 20 

cgc gag gcc gtc gag gac gtc att ctg get cgc cgt ccg gac gec acc 1920 
Arg Glu Ala Val Glu Asp Val lie Leu Ala Arg Arg Pro Asp Ala Thr 
625 "0 635 640 

gag cgt ttg ctg acg ctg gcc gac cgc tac aag gac ate aag cgc gaa 1968 
Glu Arg Leu Leu Thr Leu Ala Asp Arg Tyr Lys Asp He Lys Arg Glu 

"5 650 655 ( 

agt gcc gcc cag age gcc tgg cgc gac ctg ccg gtg cag gaa egg ctg 2016 
Ser Ala Ala Gin Ser Ala Trp Arg Asp Leu Pro Val Gin Glu Arg Leu 
660 665 670 

egg cac gca ctg gtg cag ggc gtc gcc gac cac gtg gat gag gac gcc 2064 
Arg His Ala Leu Val Gin Gly Val Ala Asp His Val Asp Glu Asp Ala 
675 680 685 

gag gcc gcc tat cag gaa etc ggc age ccg ctg gcc gtc ate gaa ggc 2112 
Glu Ala Ala Tyr Gin Glu Leu Gly Ser Pro Leu Ala Val He Glu Gly 
690 695 700 

ccg ctg atg gac ggc atg aac gtg gtg ggc gac etc ttc ggc gcg ggg 2160 
Pro Leu Met Asp Gly Met Asn Val Val Gly Asp Leu Phe Gly Ala Gly 
705 710 7is 72 J 

aaa atg ttc ctg ccg cag gtg gtc aaa tec gcc cgc gtg atg aaa aag 2208 
Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala Arg Val Met Lys Lys 
? 25 730 735 

^ 3 !f° ta ° ° tC a ° 9 CCC tat ct * 9aa 9 C 9 9 a 9 aa 9 9 C * 9aa age 2256 
Ala Val Ala Tyr Leu Thr Pro Tyr Leu Glu Ala Glu Lys Ala Glu Ser 
740 745 750 

tec age aag ggc aag gta ctg ctg gcg acc gtc aag ggc gat gtg cac 2304 
Ser Ser Lys Gly Lys Val Leu Leu Ala Thr Val Lys Gly Asp Val His 
7 55 760 765 

gac ate ggc aag aac ate gtg ggc gtg gtg etc gcc tgc aac ggc tat 2352 
Asp He Gly Lys Asn He Val Gly Val Val Leu Ala Cys Asn Gly Tyr 
770 775 780 

cag gtg acc gac etc ggc gtg atg gtg ccg ggc gag aag att ctg gac 2400 
Gin Val Thr Asp Leu Gly Val Met Val Pro Gly Glu Lys He Leu Asp 
785 790 795 800 

gaa gcc gag egg etc ggt gcc gac gtg ate ggt ctg age ggg ctg att 2448 
Glu Ala Glu Arg Leu Gly Ala Asp Val He Gly Leu Ser Gly Leu He 
8 05 810 815 

acg cct tec tta gac gaa atg gtg aac gtg gcc cgc gag atg acg cgc 2496 
Thr Pro Ser Leu Asp Glu Met Val Asn Val Ala Arg Glu Met Thr Arg 
820 825 830 

egg ggc gtg aaa act cca ctg ctg ate ggc ggc gcg acg acc age egg 2544 
Arg Gly Val Lys Thr Pro Leu Leu He Gly Gly Ala Thr Thr Ser Arg 
835 840 845 

gcg cac acg gcg gtc aag att gac ccg gcc tac gac ggg acg gta gtg 2592 
Ala His Thr Ala Val Lys He Asp Pro Ala Tyr Asp Gly Thr Val Val 
850 855 860 
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cac gtg ctg gac gcc age cgc gec gtg acc gtg acc aac gac ctg ctg 2640 
Hie Val Leu Asp Ala Ser Arg Ala Val Thr Val Thr Asn Asp Leu Leu 
865 370 875 880 

acc gac gag gcc gcc tac get ggg cgc gtg cag ggc gag tat gac acc 2688 
Thr Asp Glu Ala Ala Tyr Ala Gly Arg Val Gin Gly Glu Tyr Asp Thr 
885 890 095 

ttg cgc gag cgc cac ggc gag egg cag gtg egg ctg att gcg ctg gca 2736 
Leu Arg Glu Arg His Gly Glu Arg Gin Val Arg Leu lie Ala Leu Ala 
500 905 910 

gaa gcc cgc gcc cgc gcc ccg caa ctg agt gcc gcc gtg ccc ccc gcg 2784 
Glu Ala Arg Ala Arg Ala Pro Gin Leu Ser Ala Ala Val Pro Pro Ala 
915 920 925 



ccg cac gat ctg ggc cgt cag gtg gtc gaa cag ccc att gcc gag ctg 
Pro His Asp Leu Gly Arg Gin Val Val Glu Gin Pro He Ala Glu Leu 
530 935 940 



2832 



ctg ccc ttc ate gac tgg acg ccc ttt ttc ate gcc tgg gag atg aag 2880 
Leu Pro Phe He Asp Trp Thr Pro Phe Phe He Ala Trp Glu Met Lye 
945 950 955 960 

ggc ate tac ccg ggc ate ctg acc gac cct ctg cgt ggc gag gag gcc 2928 
Gly He Tyr Pro Gly He Leu Thr Asp Pro Leu Arg Gly Glu Glu Ala 
965 970 " 975 

cgc aag ctg ttt gcc gac gcg cag gcg ctg ctg gag cag gtt ate gcc 2976 
Arg Lys Leu Phe Ala Asp Ala Gin Ala Leu Leu Glu Gin Val He Ala 
980 985 990 

gac ggc teg ctg egg gcg cgc ggc gtc ate ggg ctg tgg ccc gcg cac 3024 
Asp Gly Ser Leu Arg Ala Arg Gly Val He Gly Leu Trp Pro Ala H1b 
995 1000 1005 

ggc gac gac ate gtg ctg gac gat gcg gcg atg ggg cgt ggc gag acg 3072 
Gly Asp Asp He Val Leu Asp Asp Ala Ala Met Gly Arg Gly Glu Thr 
1010 1015 1020 

ctg gat ttc gag acg cac gaa etc gcc gcc ggg cgc gag ccg ctg ccg 3120 
Leu Asp Phe Glu Thr His Glu Leu Ala Ala Gly Arg Glu Pro Leu Pro 
1025 1030 1035 1040 

aac atg ccg cgc ctg cac acg ctg egg cag cag cgc gac cag acc acg 3168 
Asn Met Pro Arg Leu His Thr Leu Arg Gin Gin Arg Asp Gin Thr Thr 
1045 1050 1055 

ccg aac act gcg ctg get gac ttt gtg gcg gaa gga ggc gac cac ate 3216 
Pro Asn Thr Ala Leu Ala Asp Phe Val Ala Glu Gly Gly Asp His He 
1060 1065 1070 

ggc gcc ttc gcc acg gcc ate ttc ggc gcc gag gag ttg gcg cag cag 3264 
Gly Ala Phe Ala Thr Ala He Phe Gly Ala Glu Glu Leu Ala Gin Gin 
1075 1080 1085 

ttc gag gcg cag cac gac gac tac aac teg att ctg gtc aag gcg gtg 3312 
Phe Glu Ala Gin His Asp Asp Tyr Asn Ser He Leu Val Lys Ala Val 
1090 1095 lioo 

gcc gac cga ctg gcc gag gcc ttt gcc gag aag ctg cac cgc gac gtg 3360 
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Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Lyo Leu His Arg Asp Val 
1105 mo i 115 ^ U20 

cgc gtg egg cac tgg ggt tac gec gag ggc gag gcg etc gac aac ace 3408 
Arg val Arg Hie Trp Gly Tyr Ala Glu Gly Glu Ala Leu Asp Asn Thr 
U25 1130 H35 

gac etc ate aag gag cgc tat cag ggc ate cgc cct gcg ccc ggc tac 3456 
Asp Leu lie Lya Glu Arg Tyr Gin Gly He Arg Pro Ala Pro Gly Tyr 
1140 H45 H50 

ccc gcg cag ccc gac cac acc gag aaa cgc acc ctg ttt gag ctg ctg 3504 
Pro Ala Gin Pro Asp His Thr Glu Lys Arg Thr Leu Phe Glu Leu Leu 
1155 H60 1165 

gac gcg gaa age ate ggc ctg cgc etc acc gag teg tgt gee atg acc 3552 
Asp Ala Glu Ser He Gly Leu Arg Leu Thr Glu Ser Cys Ala Met Thr 
1170 H75 liao 

ccg gcg gcg gcg gtg teg ggg ctg tac ttc gcg cat ccg gag gee cgt 3600 
Pro Ala Ala Ala Val Ser Gly Leu Tyr Phe Ala His Pro Glu Ala Arg 
1185 H50 H95 1200 

tat ttc gca gtg ggc cgc ate ggg cgc gac cag gtg gag aac tac gec 3646 
Tyr Phe Ala Val Gly Arg He Gly Arg Asp Gin Val Glu Asn Tyr Ala 
1205 1210 1215 

gec cgt aag ggt tgg act gtg cag gaa gee gag cgc tgg ctg ggg ccg 3696 
Ala Arg Lys Gly Trp Thr Val Gin Glu Ala Glu Arg Trp Leu Gly Pro 
1220 1225 1230 

ctg ctg gcg tac age gec ggg ccg ggg cca gaa gca age cag aaa gee 3744 
Leu Leu Ala Tyr Ser Ala Gly Pro Gly Pro Glu Ala Ser Gin Lys Ala 
1235 1240 1245 

etc ggc gca gag ctg aca gga gcg caa teg tga 3777 
Leu Gly Ala Glu Leu Thr Gly Ala Gin Ser 
1250 1255 

<210> 36 
<211> 1258 
<212> PRT 

<213> Deinococcus radiodurans 
<400> 36 

Met Ser His His Pro Glu Ala Ser Ala Ser Ala Asn Pro Ser He Asn 
15 10 15 

His Gin Pro Ser Thr He Thr Glu Ala Ala Arg Gin Arg He Leu lie 
20 25 30 

Leu Asp Gly Ala Trp Gly Thr Gin Leu Gin Arg Ala Asn Leu Thr Glu 
35 40 45 

Ala Asp Phe Arg Trp Asp Glu Ala Asp Pro Thr Arg Met Tyr Arg Gly 
50 55 60 

Asn Phe Asp Leu Leu Gin Leu Thr Lys Pro Asp Val He Arg Ala Val 
65 70 75 80 

His Arg Ala Tyr Phe Glu Ala Gly Ala Asp He Ala Ser Thr Asn Thr 
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85 90 95 

Phe Asn Ser Thr Thr He Ser Oln Ala Asp Tyr Qly Thr Glu Ala Leu 
100 105 

Ala Tyr Ala Met Asn Arg Glu Gly Ala Arg Leu Ala Arg Glu Val Ala 
115 120 125 

Asp Glu Phe Glu Ala Arg Asp Gly Lys Lys Arg Trp Val Ala Gly Ser 
130 135 140 

Val Gly Pro Thr Asn Arg Thr Ala Thr Leu Ser Pro Asp Val Glu Arg 
145 150 155 160 

Pro Glu Phe Arg Asn Val Thr Tyr Asp Asp Leu Val Ala Ala Tyr Ser 
165 170 175 

Glu Ala lie Thr Gly Leu Met Glu Gly Gly Ala Asp Leu Leu Leu He 
180 185 190 

Glu Thr Val Phe Asp Thr Leu Asn Ala Lys Ala Ala Leu Phe Ala Ala 
195 200 205 

Gin Asp Val Phe Ala Ala Gin Gly Arg Glu Leu Pro Val Met Leu Ser 
210 215 220 

Gly Thr He Thr Ab P Ala Ser Gly Arg Thr Leu Ser Gly Gin Thr Pro 
225 230 235 240 

Glu Ala Phe Ala Val Ser Thr Glu His Ala Gly Leu Phe Ser Leu Gly 
245 250 255 

Leu Asn Cys Ala Leu Gly Ala Asp Leu Leu Arg Pro His Leu Arg Ala 
260 265 270 

He Ala Ala Asn Thr Glu Ala Leu Val Ser Val His Pro Asn Ala Gly 
275 280 285 

Leu Pro Asn Ala Phe Gly Glu Tyr Asp Glu Thr Pro Glu His Thr Ala 
290 295 300 

Ala Val Leu Ala Asp Phe Ala Arg Glu Gly Leu Val Asn He Val Gly 
305 310 315 320 

Gly Cys Cys Gly Thr Thr Pro Glu His He Lys Ala He Ala Glu Ala 
325 330 335 

Val Lys Asp He Pro Pro Arg Gin Ala Leu Gin Leu Pro Pro Tyr Leu 
340 345 350 

Arg Leu Ser Gly Leu Glu Ala Phe Thr Leu Thr Pro Glu Thr Asn Phe 
355 360 365 

Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Pro Lys Phe Ser 
370 375 380 

Lys Ala He Leu Ala Gly Asp Tyr Asp Ala Gly Leu Lys He Ala Arg 
385 390 395 400 

Gin Gin Val Thr Asn Gly Ala Gin He Val Asp He Asn Phe Asp Glu 
405 410 4 15 
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Gly Met Leu Asp Gly Glu Gly Ala Met Val Lys Phe Leu Asn Leu Leu 
420 425 430 

Ala Gly Glu Pro Asp He Ser Arg Val Pro Leu Met Leu Asp Ser Ser 
435 440 445 

Lys Trp Glu He Leu Glu Ala Gly Leu Arg Arg Val Gin Gly Lys Ala 
450 455 460 

Val Val Asn Ser He Ser Leu Lys Asp Gly Glu Ala Arg Phe Leu Glu 
465 470 475 480 

Arg Ala Arg Leu Leu Arg Arg Tyr Gly Ala Ala Ala Val Val Met Ala 
485 490 495 

Phe Abp Glu Gin Gly Gin Ala Asp Asn Leu Ala Arg Arg Arg Glu He 
500 505 sio 

Leu Gly Arg Ala Tyr Arg Leu Leu Thr Glu Gin Ala Asp Phe Pro Pro 
515 520 525 

Gin Asp He He Phe Asp Pro Asn Val Leu Thr Val Ala Thr Gly He 
530 535 540 

Glu Glu His Asp Arg Tyr Ala Leu Asp Phe He Glu Ala Thr Arg Trp 
545 550 555 560 

He Lys Glu Asn Leu Pro Ala Ala Lys Val Ser Gly Gly He Ser Asn 
565 570 575 

Val Ser Phe Ser Phe Arg Gly Asn Asn His Val Arg Glu Ala Met His 
580 585 590 

Ala Val Phe Leu Tyr His Ala He Arg Ala Gly Leu Asp Met Gly He 
595 600 605 

Val Asn Ala Gly Met Leu Ala Val Tyr Glu Asp He Glu Pro Glu Leu 
610 615 620 

Arg Glu Ala Val Glu Asp Val He Leu Ala Arg Arg Pro Ab P Ala Thr 
625 630 635 640 

Glu Arg Leu Leu Thr Leu Ala Asp Arg Tyr Lys Asp He Lys Arg Glu 
645 650 655 

Ser Ala Ala Gin Ser Ala Trp Arg Asp Leu Pro Val Gin Glu Arg Leu 
660 665 670 

Arg His Ala Leu Val Gin Gly Val Ala Asp His Val Asp Glu Asp Ala 
675 680 685 

Glu Ala Ala Tyr Gin Glu Leu Gly Ser Pro Leu Ala Val He Glu Gly 
690 695 700 

Pro Leu Met Asp Gly Met Asn Val Val Gly Asp Leu Phe Gly Ala Gly 
705 710 715 720 

Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala Arg Val Met Lys Lys 
725 730 735 

Ala Val Ala Tyr Leu Thr Pro Tyr Leu Glu Ala Glu Lys Ala Glu Ser 
740 745 750 
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Ser Ser Lys Gly Lys Val Leu Leu Ala Thr Val Lys Gly Asp Val His 
755 760 765 

Asp He Gly Lys Asn He Val Gly Val Val Leu Ala Cys Asn Gly Tyr 
770 775 780 . 

Gin Val Thr Asp Leu Gly Val Met Val Pro Gly Glu Lys He Leu Asp 
785 790 795 800 

Glu Ala Glu Arg Leu Gly Ala Asp Val lie Gly Leu Ser Gly Leu He 
805 810 815 

Thr Pro Ser Leu Asp Glu Met Val Asn Val Ala Arg Glu Met Thr Arg 
820 825 ^ 830 

Arg Gly Val Lys Thr Pro Leu Leu He Gly Gly Ala Thr Thr Ser Arg 
835 840 845 

Ala His Thr Ala Val Lys He Asp Pro Ala Tyr Asp Gly Thr Val Val 
850 855 860 

His Val Leu Asp Ala Ser Arg Ala Val Thr Val Thr Asn Asp Leu Leu 
865 870 875 880 

Thr Asp Glu Ala Ala Tyr Ala Gly Arg Val Gin Gly Glu Tyr Asp Thr 
885 890 * 895 

Leu Arg Glu Arg His Gly Glu Arg Gin Val Arg Leu He Ala Leu Ala 
900 90S 910 

Glu Ala Arg Ala Arg Ala Pro Gin Leu Ser Ala Ala Val Pro Pro Ala 
915 920 925 

Pro His Asp Leu Gly Arg Gin Val Val Glu Gin Pro He Ala Glu Leu 
930 935 940 

Leu Pro Phe He Asp Trp Thr Pro Phe Phe He Ala Trp Glu Met LyB 
945 950 955 960 

Gly He Tyr Pro Gly He Leu Thr Asp Pro Leu Arg Gly Glu Glu Ala 
965 970 * 975 

Arg Lys Leu Phe Ala Asp Ala Gin Ala Leu Leu Glu Gin Val He Ala 
980 985 990 

Asp Gly Ser Leu Arg Ala Arg Gly Val He Gly Leu Trp Pro Ala His 
995 1000 1005 

Gly Asp Asp He Val Leu Asp Asp Ala Ala Met Gly Arg Gly Glu Thr 
1010 1015 1020 

Leu Asp Phe Glu Thr His Glu Leu Ala Ala Gly Arg Glu Pro Leu Pro 
1025 1030 1035 1040 

Asn Met Pro Arg Leu His Thr Leu Arg Gin Gin Arg Asp Gin Thr Thr 
1045 1050 1055 

Pro Asn Thr Ala Leu Ala Asp Phe Val Ala Glu Gly Gly Asp HiB He 
1060 1065 ' 1070 

Gly Ala Phe Ala Thr Ala He Phe Gly Ala Glu Glu Leu Ala Gin Gin 
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1075 1080 1085 

Phe Glu Ala Gin His Asp Asp Tyr Asn Ser He Leu Val Lys Ala Val 
1090 1095 lioo 

Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Lys Leu His Arg Asp Val 
"OS 1110 ins 1120 

Arg Val Arg His Trp Gly Tyr Ala Glu Gly Glu Ala Leu Asp Asn Thr 
1125 mo H35 

Asp Leu He Lys Glu Arg Tyr Gin Gly He Arg Pro Ala Pro Gly Tyr 
1140 H45 1150 

Pro Ala Gin Pro Asp His Thr Glu Lys Arg Thr Leu Phe Glu Leu Leu 
1155 H60 ^ H65 

Asp Ala Glu Ser He Gly Leu Arg Leu Thr Glu Ser Cys Ala Met Thr 
1170 1175 H80 

Pro Ala Ala Ala Val Ser Gly Leu Tyr Phe Ala His Pro Glu Ala Arg 
H85 1190 1195 1200 

Tyr Phe Ala Val Gly Arg He Gly Arg Asp Gin Val Glu Asn Tyr Ala 
1205 1210 1215 

Ala Arg Lys Gly Trp Thr Val Gin Glu Ala Glu Arg Trp Leu Gly Pro 
1220 1225 1230 

Leu Leu Ala Tyr Ser Ala Gly Pro Gly Pro Glu Ala Ser Gin Lys Ala 
1235 1240 1245 

Leu Gly Ala Glu Leu Thr Gly Ala Gin Ser 
1250 1255 



<210> 37 
<211> 3642 
<212> DNA 

<213> Clostridium acetobutylicum 

<220> 

<221> CDS 

<222> (1)..(3639) 

<223> RCA01265 

<400> 37 

ctt atg aat tct tea eta aag aat ttg tta aat aac aaa att tta gtt 48 

Leu Met Asn Ser Ser Leu Lys Asn Leu Leu Asn Asn Lys He Leu Val 
1 5 io ' is 



tta gat ggt get atg gga aca tgt att caa tec ttt aat eta gat gaa 
Leu Asp Gly Ala Met Gly Thr Cys He Gin Ser Phe Asn Leu Asp Glu 
20 25 30 



96 



ggc gac ttt aaa ggt tec tta tct tgt aca tgt cat tec aat caa aaa 144 
Gly Asp Phe Lys Gly Ser Leu Ser Cys Thr Cys His Ser ABn Gin Lys 
35 40 45 



gga aac aat gat gtt tta aat tta acc aag cca gaa ata ata aaa gaa 192 
Gly Asn Asn Asp Val Leu Asn Leu Thr Lys Pro Glu He He Lys Glu 
50 55 60 
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260 265 



270 



240 



288 



ate cac aag aga tac ctt gaa get ggc gca gat ata ata gaa aca aac 
He Hie Lys Arg Tyr Leu Glu Ala Gly Ala Asp He He Glu Thr Asn 
65 ( 70 75 .80 

act ttt aac get act gaa ata tea caa aaa gat tat aat atg caa gat 
Thr Phe Asn Ala Thr Glu He Ser Gin Lys Asp Tyr Asn Met Gin Asp 
85 90 95 

aaa ata tat gat att aat ttt aag ggg. gca aaa etc gca aag gaa get 336 
Lya He Tyr Aap He Asn Phe Lys Gly Ala Lys Leu Ala Lys Glu Ala 
100 105 no 

tgt act tac tac aca aaa eta aat cct aat aag cct aga ttt get get 384 
Cys Thr Tyr Tyr Thr Lys Leu Asn Pro Asn Lys Pro Arg Phe Ala Ala 
115 120 125 

ggt tct att ggg cct aca aat aga act get tct eta tct cca gat gtt 432 
Gly ser He Gly Pro Thr Asn Arg Thr Ala Ser Leu Ser Pro Asp Val 
130 135 140 

gaa aat cct ggt ttt aga aat gta acc ttt gat gag eta tgt aat gee 
Glu Asn Pro Gly Phe Arg Asn Val Thr Phe Asp Glu Leu Cys Asn Ala 
145 150 155 160 

tat aaa cat caa ata gag get eta ata gat gga ggt gta gac ctt ctt 
Tyr Lys His Gin He Glu Ala Leu He Asp Gly Gly Val Asp Leu Leu 
165 170 175 

tta att gaa act ata ttt gat act tta aac get aga gca gca ate ttt 
Leu He Glu Thr He Phe Asp Thr Leu Asn Ala Arg Ala Ala He Phe 
I 80 185 iso 

2f a 9ta ttfc 9aa aat aaa aaa ata aaa ctt cct *tt ata 624 
Ala Ala Glu Thr Val Phe Glu Asn Lys Lys He Lys Leu Pro He He 

I 95 200 205 

att tea ggg aca ata get gat aaa agt gga aga ata tta tec ggt caa 672 
He Ser Gly Thr He Ala Asp Lys Ser Gly Arg He Leu Ser Gly Gin 
2X0 215 220 

act ctt gac get ttt gca gaa agt tta aaa aac gaa aat ata att get 
Thr Leu Asp Ala Phe Ala Glu Ser Leu Lye Asn Glu Asn He He Ala 
225 230 235 240 

ata ggg ctt aat tgt tec ttt ggt get gaa gaa ctt ata cct ttt ata 
He Gly Leu Asn Cys Ser Phe Gly Ala Glu Glu Leu He Pro Phe He 
245 250 255 

aaa aga etc tct gaa aca caa aat aga tat ata tec ttt cat cca aac 
Lys Arg Leu Ser Glu Thr Gin Asn Arg Tyr He Ser Phe His Pro Asn 



480 



528 



576 



720 



768 



816 



gca gga ctt cca aac tec ctt ggt gaa tat gaa gaa ctg cca gag gaa 864 
Ala Gly Leu Pro Asn Ser Leu Gly Glu Tyr Glu Glu Leu Pro Glu Glu 
275 280 285 

act get age att gta aaa aaa tta gca ctt gaa gga cat tta aat ata 912 
Thr Ala Ser He Val Lys Lys Leu Ala Leu Glu Gly Hie Leu Asn He 
290 295 300 



gtt gga ggc tgc tgt ggc act aca cca gaa cat ata aga gca ata age 



960 
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Val Gly Gly Cys Cye Gly Thr Thr Pro Glu His He Arg Ala lie Ser 
305 310 315 320 

age gta gtt aaa ggc att tct cca aga aaa gtt cca aac ttg gaa ccc 
Ser Val Val Lys Gly He Ser Pro Arg Lys Val Pro Asn Leu Glu Pro 
325 330 335 

aaa aca att tac age gga eta gaa aac ata aaa att gat aag aac agt 
Lys Thr He Tyr Ser Gly Leu Glu Asn He Lys He Asp Lys Asn Ser 
340 345 350 



1008 



1056 



aac ttc ata aat ata ggc gaa aga aca aat gta gcg ggc tea aga aaa 1104 
Asn Phe He Asn He Gly Glu Arg Thr Asn Val Ala Gly Ser Arg Lvs 
355 360 365 

ttc gca agg ctt ata cgt gaa aaa aat tat gag gag get eta acc att 1152 
Phe Ala Arg Leu He Arg Glu Lys Asn Tyr Glu Glu Ala Leu Thr He 
370 375 380 

gca aga cat cag gtt gaa aat ggt gec caa att ata gat ata aat ttt 1200 
Ala Arg His Gin Val Glu Asn Gly Ala Gin He He Asp He Asn Phe 
385 390 395 400 

gat gat gca ctt tta gat get cgc tct gaa atg gaa aca ttt tta aga 1248 
Asp Asp Ala Leu Leu Asp Ala Arg Ser Glu Met Glu Thr Phe Leu Arg 
405 410 415 

ctt att gca agt gaa cct gaa ata tea aaa gtt cca gtt atg ata gac 1296 
Leu He Ala Ser Glu Pro Glu He Ser Lys Val Pro Val Met He Asp 
420 425 430 

tec tct aat ttt gaa gtt tta aaa gtt gga tta aag tct att caa ggt 1344 
Ser Ser Asn Phe Glu Val Leu Lys Val Gly Leu Lys Ser He Gin Gly 
435 440 445 

aaa gee ata gta aat tec ata agt ctt aag gtt gga gaa gaa aag ttc 1392 
Lys Ala He Val Asn Ser He Ser Leu Lys Val Gly Glu Glu Lys Phe 
450 455 460 

att gaa gag gca aaa ttt ata aag aac ttt ggc get ggc gta gtt gta 1440 
He Glu Glu Ala Lys Phe He Lys Asn Phe Gly Ala Gly Val Val Val 
465 470 475 480 

atg gee ttt gac gaa gaa ggt caa gca get act tat gaa aga aaa att 1488 
Met Ala Phe Asp Glu Glu Gly Gin Ala Ala Thr Tyr Glu Arg Lys He 
485 490 495 

gaa ate tgc aag aga get tat act att etc aca gaa aaa gtt gag ttt 1536 
Glu He Cys Lys Arg Ala Tyr Thr lie Leu Thr Glu Lys Val Glu Phe 
500 505 510 

cca cct gaa aat ata ata ttt gat cca aat ata eta tct ata gcg aca 1584 
Pro Pro Glu Asn He He Phe Asp Pro Asn He Leu Ser lie Ala Thr 
515 520 525 

gga att gaa gaa cat gac aac tat gca gtt aat tac ata aaa get gtt 1632 
Gly He Glu Glu His Asp Asn Tyr Ala Val Asn Tyr lie Lys Ala Val 
530 535 540 



aaa tgg ata aaa gag aat eta cca tac get aaa gtc age ggt gga gtt 
Lys Trp He Lys Glu Asn Leu Pro Tyr Ala Lys Val Ser Gly Gly Val 
545 550 555 560 



1680 
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age aac etc tec ttt tct ttt agg ggt aat gac gca ata aga aga get 1728 
Ser Asn Leu Ser Phe Ser Phe Arg Gly Asn Asp Ala He Arg Arg Ala 
565 570 , 57S 

atg cat tct gtt ttc ctt tac cat gca ata aac get gga atg gat atg 1776 
Met Hie Ser Val Phe Leu Tyr His Ala He Asn Ala Gly Met Asp Met 
580 585 590 

ggt att gtt aat cca gca atg att gat tta tat gac gat ata, gat aag 1824 
Gly He Val Asn Pro Ala Met He Asp Leu Tyr Asp Asp lie Asp Lys 
595 600 605 

gat ctt etc gaa aag gtt gag aat gtt gta eta aat aaa tea tct aac 1872 
Asp Leu Leu Glu Lys Val Glu Asn Val Val Leu Asn Lys Ser Ser Asn 
610 615 620 



get tct gaa tea tta eta gaa ttt get caa acg tat aaa aag acg act 
Ala Ser Glu Ser Leu Leu Glu Phe Ala Gin Thr Tyr Lys Lys Thr Thr 
«5 630 635 640 



99a gag gga aaa atg ttt ctt cct caa gta gta aaa agt get aga gtt 
Gly Glu Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala Arg Val 
705 710 715 



720 



1920 



gaa acc tta gaa aag cac gag gat gaa tgg cga caa aaa age cca agt 1968 
Glu Thr Leu Glu Lys His Glu Asp Glu Trp Arg Gin Lys Ser Pro Ser 
645 650 655 

gaa agg ttg agt tat get tta gtt aaa gga aat gtt gaa ttt att gaa 2016 
Glu Arg Leu Ser Tyr Ala Leu Val Lys Gly Asn Val Glu Phe He Glu 
660 665 670 

gaa gat ata gaa gaa gca aga aaa gag tat aca aat gca ctt gaa att 2064 
Glu Asp He Glu Glu Ala Arg Lys Glu Tyr Thr Asn Ala Leu Glu He 
675 680 685 

ata gag gtt cct tta atg aat gga atg aaa aaa gtg ggt aaa ctt ttt 2112 
He Glu Val Pro Leu Met Asn Gly Met Lys Lys Val Gly Lys Leu Phe 
690 695 700 



2160 



atg aaa aag get gtt gaa tgt ctt ctt ccc tat ata aac gaa gaa aag 2208 
Met Lys Lys Ala Val Glu Cys Leu Leu Pro Tyr He Asn Glu Glu Lys 
725 730 735 

tct aaa aat cac aat aaa agt get ggt aag gtt gta ttt gca act gtt 2256 
Ser Lys Asn His Asn Lys Ser Ala Gly Lys Val Val Phe Ala Thr Val 
740 745 750 

aaa ggc gat gtt cat gac ata ggc aaa aat ate gta tct gta gtt ctt 2304 
Lys Gly Asp Val His Asp He Gly Lys Asn lie Val Ser Val Val Leu 
755 760 765 

tec tgc aac aat ttt gaa gtt ata gat tta gga gta atg gtt ccc cct 2352 
Ser Cys Asn Asn Phe Glu Val He Asp Leu Gly Val Met Val Pro Pro 
770 775 780 

gaa acc ata ctt gaa acg gca aaa cgt gaa aat gca gat ate att get 2400 
Glu Thr He Leu Glu Thr Ala Lys Arg Glu Asn Ala Asp He He Ala 
785 790 795 800 

tta agt ggt tta att aca cct tct ctt aat gaa atg get tat gta get 2448 



WO 03/087386 



PCT/EP03/04010 



165 

Leu Ser Gly Leu He Thr Pro Ser Leu Asn Glu Met Ala Tyr Val Ala 
805 810 815 

gaa gaa atg aaa agg ctt aat ttt gat ata cca ctt atg gtg ggt ggt 2496 
Glu Glu Met Lys Arg Leu Asn Phe Asp He Pro Leu Met Val Gly Gly 
820 825 830 

get get acc tea aaa act cac aca get tta aaa eta get acg aaa tat 2544 
Ala Ala Thr Ser Lys Thr His Thr Ala Leu Lys Leu Ala Thr Lys Tyr 
835 840 845 

aaa tat gta gta cac agt act gat get tea gat get gtt acc gta gee 2592 
Lys Tyr Val Val His Ser Thr Asp Ala Ser Asp Ala Val Thr Val Ala 
850 855 860 



aaa aat eta atg agt gaa aac aaa ttt act ttc tta gaa aaa tta aat 2640 
Lys Asn Leu Met Ser Glu Asn Lys Phe Thr Phe Leu Glu Lys Leu Asn 
865 870 875 * 880 

gaa gag tat tct aaa ata aga gag acc ttc tct act aat aag att gaa 2688 
Glu Glu Tyr Ser Lye He Arg Glu Thr Phe Ser Thr Asn Lys He Glu 
885 690 895 

ctt ate tec att caa aac gca aga aaa aac aga ttt act att gac tgg 2736 
Leu He Ser He Gin Asn Ala Arg Lys Asn Arg Phe Thr He Asp Trp 
900 905 ~ 910 

aat aaa act aaa ata act gaa cct aaa ttt gtc ggt ata aaa aaa tta 2784 
Asn Lys Thr Lys He Thr Glu Pro Lys Phe Val Gly He Lys Lys Leu 
915 920 925 

caa get gta cct ata aat gaa tta aga aag tat ata gat tgg act ttc 2832 
Gin Ala Val Pro He Asn Glu Leu Arg Lys Tyr He Asp Trp Thr Phe 
930 935 ~ 940 

ttc ttt acg tct tgg gat atg gga atg aat tac ccc aaa ata atg aaa 2880 
Phe Phe Thr Ser Trp Asp Met Gly Met Asn Tyr Pro Lys He Met Lys 
945 950 955 960 

gat cct aaa tac gga get gaa get caa aaa etc ttt aag gat gee aat 2928 
Asp Pro Lys Tyr Gly Ala Glu Ala Gin Lys Leu Phe Lye Asp Ala Asn 
965 970 975 

gaa atg ctt gat tta ttg caa aaa gaa aat tta ate act tgt aat gga 2976 
Glu Met Leu Asp Leu Leu Gin Lys Glu Asn Leu He Thr Cys Asn Gly 
980 985 990 

gtt ttt gga ata ttc cca get aat tct gtt aat gat gat ata gaa ate 3024 
Val Phe Gly He Phe Pro Ala Asn Ser Val Asn Asp Asp He Glu He 
995 1000 1005 

tac act gat aaa gga act gta acc ata aat act ctt cgt cag cag cag 3072 
Tyr Thr Asp Lys Gly Thr Val Thr He Asn Thr Leu Arg Gin Gin Gin 
1010 .1015 1020 

ata ctt aaa gac age gat tat aaa get eta tct gat tat ate get cca 3120 
He Leu Lys Asp Ser Asp Tyr Lys Ala Leu Ser Asp Tyr He Ala Pro 
1025 1030 1035 1040 

aag ggt att ggc ate aaa gat tat ata ggt ggt ttt att gta act get 3168 
Lys Gly He Gly He Lys Asp Tyr He Gly Gly Phe He Val Thr Ala 
1045 1050 1055 
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gga ata ggt gca aag gaa tat tec gat aaa tta aag aaa aaa tgc gac 3216 
Gly He Gly Ala Lys Glu Tyr Ser Asp Lys Leu Lys Lys Lys Cys Asp 
1060 1065 1070 

gat tat gga get act atg ctt aaa ctt ata tgc gat aga ctt gca gag 3264 
Asp Tyr Gly Ala Thr Met Leu Lys Leu He Cys Asp Arg Leu Ala Glu 
1075 1080 1085 

gee ttt tea gaa ctt ctt cac eta agg gta aga aaa gaa tac tgg gga 3312 
Ala Phe Ser Glu Leu Leu His Leu Arg Val Arg Lys Glu Tyr Trp Gly 
1090 1095 iioo 

tac tct caa gat gaa aac tta tec tta gaa aaa ctt ctt aaa gga agt 
Tyr Ser Gin Asp Glu Asn Leu Ser Leu Glu Lys Leu Leu Lys Gly Ser 
1105 mo u 15 



3360 



1120 



tac aga ggg ata aaa cca get att gga tat cct tct att ccc gat cac 3408 
Tyr Arg Gly He Lys Pro Ala He Gly Tyr Pro Ser He Pro Ab P His 
H25 H30 n35 

tct gaa aaa gca aag tta ttt gat tta ctt tta ggt aaa act tea ata 3456 
Ser Glu Lys Ala Lys Leu Phe Asp Leu Leu Leu Gly Lys Thr Ser He 
II 40 1145 1150 

gga gtg gaa ttg acg gaa agt tat atg atg aat cca act tea agt gta 3504 
Gly Val Glu Leu Thr Glu Ser Tyr Met Met Asn Pro Thr Ser Ser Val 
1155 H60 H65 

tgc ggt ttg tat ttt gca aat gaa cga gca aaa tac ttt aat ata aat 3552 
Cye Gly Leu Tyr Phe Ala Asn Glu Arg Ala Lys Tyr Phe Asn He Asn 
1170 1175 neo 

aaa ata gga aaa gat caa ctt gag gac tat get gtt cga agt aat aaa 3600 
Lys He Gly Lys Asp Gin Leu Glu Asp Tyr Ala Val Arg Ser Asn Lys 
1185 H90 1195 1200 

gac att aat gaa ata aaa aaa tta tta gat act ctg tta taa 3642 
Asp He Asn Glu He Lys Lys Leu Leu Asp Thr Leu Leu 
1205 1210 



<210> 38 
<211> 1213 
<212> PRT 

<213> Clostridium acetobutylicum 
<400> 38 

Leu Met Asn Ser Ser Leu Lys Asn Leu Leu Asn Asn Lys He Leu Val 
15 10 is 

Leu Asp Gly Ala Met Gly Thr Cys He Gin Ser Phe Asn Leu Asp Glu 
20 25 30 

Gly Asp Phe Lys Gly Ser Leu Ser Cys Thr Cys His Ser Asn Gin Lys 
35 40 45 

Gly Asn Asn Asp Val Leu Asn Leu Thr Lys Pro Glu He He Lys Glu 
50 55 60 



He His Lys Arg Tyr Leu Glu Ala Gly Ala Asp He He Glu Thr Asn 
65 70 75 no 
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Thr Phe Asn Ala Thr Glu lie Ser Gin Lys Asp Tyr Asn Met Olh Asp 
85 90 95 

Lys lie Tyr Asp He Asn Phe Lys Gly Ala Lys Leu Ala Lys Glu Ala 
100 105 no 

Cys Thr Tyr Tyr Thr Lys Leu Asn Pro Asn Lys Pro Arg Phe Ala Ala 
115 120 125 

Gly Ser He Gly Pro Thr Asn Arg Thr Ala Ser Leu Ser Pro Asp Val 
130 135 140 

Glu Asn Pro Gly Phe Arg Asn Val Thr Phe Asp Glu Leu Cys Asn Ala 
145 150 155 160 

Tyr Lys His Gin He Glu Ala Leu He Asp Gly Gly Val Asp Leu Leu 
165 170 175 

Leu He Glu Thr He Phe Asp Thr Leu Asn Ala Arg Ala Ala He Phe 
180 185 190 

Ala Ala Glu Thr Val Phe Glu Asn Lys Lys He Lys Leu Pro He He 
195 200 205 

He Ser Gly Thr He Ala Asp Lys Ser Gly Arg He Leu Ser Gly Gin 
210 215 220. 

Thr Leu Asp Ala Phe Ala Glu Ser Leu Lys Asn Glu Asn He He Ala 
225 230 235 240 

He Gly Leu Asn Cys Ser Phe Gly Ala Glu Glu Leu He Pro Phe He 
245 250 255 

Lys Arg Leu Ser Glu Thr Gin Asn Arg Tyr He Ser Phe His Pro Asn 
260 265 270 

Ala Gly Leu Pro Asn Ser Leu Gly Glu Tyr Glu Glu Leu Pro Glu Glu 
275 280 285 

Thr Ala Ser He Val Lys Lys Leu Ala Leu Glu Gly His Leu Asn He 
290 295 300 

Val Gly Gly Cys Cys Gly Thr Thr Pro Glu His He Arg Ala He Ser 
305 310 315 " 320 

Ser Val Val Lys Gly He Ser Pro Arg Lys Val Pro Asn Leu Glu Pro 
325 330 335 

Lys Thr He Tyr Ser Gly. Leu Glu Asn lie Lys He Asp Lys Asn Ser 
340 345 350 

Asn Phe He Asn lie Gly Glu Arg Thr Asn Val Ala Gly Ser Arg Lys 
355 360 365 

Phe Ala Arg Leu He Arg Glu Lys Asn Tyr Glu Glu Ala Leu Thr lie 
370 375 380 

Ala Arg His Gin Val Glu Asn Gly Ala Gin lie lie Asp He Asn Phe 
385 390 395 400 

Asp Asp Ala Leu Leu Asp Ala Arg Ser Glu Met Glu Thr Phe Leu Arg 
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405 410 415 

Leu He Ala Ser Glu Pro Glu He Ser Lys Val Pro Val Met He Asp 
420 425 430, 

Ser Ser Asn Phe Glu Val Leu Lys Val Gly Leu Lys Ser He Gin Gly 
435 440 445 

Lys Ala lie Val Asn Ser He Ser Leu Lys Val Gly Glu Glu Lys Phe 
450 455 4 6 o 

He Glu Glu Ala Lys Phe He Lys Asn Phe Gly Ala Gly Val Val Val 
465 4 ?0 475 480 

Met Ala Phe Asp Glu Glu Gly Gin Ala Ala Thr Tyr Glu Arg Lys He 
485 490 495 

Glu He Cys Lys Arg Ala Tyr Thr He Leu Thr Glu Lys Val Glu Phe 
500 505 510 

Pro Pro Glu Asn lie lie Phe Asp Pro Asn He Leu Ser lie Ala Thr 
515 520 525 

Gly lie Glu Glu His Asp Asn Tyr Ala Val Asn Tyr lie Lys Ala Val 
530 535 540 

Lys Trp lie Lys Glu Asn Leu Pro Tyr Ala Lys Val Ser Gly Gly Val 
545 550 555 560 

Ser Asn Leu Ser Phe Ser Phe Arg Gly Asn Asp Ala lie Arg Arg Ala 
565 570 575 

Met His Ser Val Phe Leu Tyr His Ala He Asn Ala Gly Met Asp Met 
580 585 590 

Gly He Val Asn Pro Ala Met lie Asp Leu Tyr Asp Asp He Asp Lys 
535 600 60S 

Asp Leu Leu Glu Lys Val Glu Asn Val Val Leu Asn Lys Ser Ser Asn 
610 615 620 

Ala Ser Glu Ser Leu Leu Glu Phe Ala Gin Thr Tyr Lys Lys Thr Thr 
625 «0 635 6 40 

Glu Thr Leu Glu Lys His Glu Asp Glu Trp Arg Gin Lys Ser Pro Ser 
645 650 655 

Glu Arg Leu Ser Tyr Ala Leu Val Lys Gly Asn Val Glu Phe lie Glu 
660 665 670 

Glu Asp lie Glu Glu Ala Arg Lys Glu Tyr Thr Asn Ala Leu Glu lie 
675 680 685 

lie Glu Val Pro Leu Met Asn Gly Met Lys Lys Val Gly Lys Leu Phe 
690 695 700 

Gly Glu Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala Arg Val 
705 710 715 720 

Met Lys Lys Ala Val Glu Cys Leu Leu Pro Tyr lie Asn Glu Glu Lys 
72 5 730 735 
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Ser Lye Asn His Asn Lys Ser Ala Gly Lye Val Val Phe Ala Thr Val 
740 745 750 

Lys Gly Asp Val His Asp He Gly Lys Asn He Val Ser Val Val Leu 
755 760 765 

Ser Cys Asn Asn Phe Glu Val He Asp Leu Gly Val Met Val Pro Pro 
770 775 780 

Glu Thr He Leu Glu Thr Ala Lys Arg Glu Asn Ala Asp He He Ala 
785 790 795 800 

Leu Ser Gly Leu He Thr Pro Ser Leu Asn Glu Met Ala Tyr Val Ala 
805 810 815 

Glu Glu Met Lys Arg Leu Asn Phe Asp He Pro Leu Met Val Gly Gly 
820 825 830 

Ala Ala Thr Ser Lys Thr His Thr Ala Leu Lys Leu Ala Thr Lys Tyr 
835 840 845 

Lys Tyr Val Val His Ser Thr Asp Ala Ser Asp Ala Val Thr Val Ala 
850 855 860 

Lys Asn Leu Met Ser Glu Asn Lys Phe Thr Phe Leu Glu Lys Leu Asn 
865 870 875 880 

Glu Glu Tyr Ser Lys He Arg Glu Thr Phe Ser Thr Asn Lys He Glu 
885 890 895 

Leu He Ser He Gin Asn Ala Arg Lys Asn Arg Phe Thr He Asp Trp 
900 90S 910 

Asn Lys Thr Lys He Thr Glu Pro Lys Phe Val Gly He Lys Lys Leu 
915 920 925 

Gin Ala Val Pro He Asn Glu Leu Arg Lys Tyr He Asp Trp Thr Phe 
930 935 940 

Phe Phe Thr Ser Trp Asp Met Gly Met Asn Tyr Pro Lys He Met Lys 
945 950 955 960 

Asp Pro Lys Tyr Gly Ala Glu Ala Gin Lys Leu Phe Lys Asp Ala Asn 
965 970 975 

Glu Met Leu Asp Leu Leu Gin Lys Glu Asn Leu He Thr CyB Asn Gly 
980 985 990 

Val Phe Gly He Phe Pro Ala Asn Ser Val Asn Asp Asp He Glu He 
995 1000 1005 

Tyr Thr Asp Lys Gly Thr Val Thr He Asn Thr Leu Arg Gin Gin Gin 
1010 1015 1020 

He Leu Lys Asp Ser Asp Tyr Lys Ala Leu Ser Asp Tyr He Ala Pro 
1025 1030 1035 1040 

Lys Gly He Gly He Lys Asp Tyr He Gly Gly Phe He Val Thr Ala 
1045 1050 1055 

Gly He Gly Ala Lys Glu Tyr Ser Asp Lys Leu Lys Lys Lys Cys Asp 
1060 1065 1070 



WO 03/087386 PCT/EP03/04010 

170 

Asp Tyr Gly Ala Thr Met Leu Lye Leu lie Cys Asp Arg Leu Ala Glu . 
1075 1080 1085 

Ala Phe Ser Glu Leu Leu His Leu Arg Val Arg Lys Glu Tyr Tro Glv 
1090 1095 uio . 

Tyr Ser Gin Asp Glu Asn Leu Ser Leu Glu Lys Leu Leu Lys Gly Ser 
U05 1110 ins H20 

Tyr Arg Gly He Lys Pro Ala He Gly Tyr Pro Ser He Pro Asp His 
1125 mo H35 

Ser Glu Lys Ala Lys. Leu Phe Asp Leu Leu Leu Gly Lys Thr Ser He 
H40 H45 1150 

Gly Val Glu Leu Thr Glu Ser Tyr Met Met Asn Pro Thr Ser Ser Val 
1155 H60 H65 

Cys Gly Leu Tyr Phe Ala Asn Glu Arg Ala Lys Tyr Phe Asn He Asn 
H70 1175 H80 

Lys He Gly Lys Asp Gin Leu Glu Asp Tyr Ala Val Arg Ser Asn LyB 
H fl 5 1190 H95 1200 

Asp He Asn Glu He Lys Lys Leu Leu Asp Thr Leu Leu 
1205 1210 

<210> 39 
<211> 3954 
<212> DNA 

<213> Caulobacter_crescentus 

<220> 
<221> CDS 
<222> (1)..(3951) 
<223> RCO02271 

<400> 39 

atg acc gat etc tec ate cgc gee aac cgc gtc gee gee ctg aag gec 46 
Met Thr Asp Leu Ser He Arg Ala Asn Arg Val Ala Ala Leu Lys Ala 
15 io is 

gec gee aag gag cgt att etc att etc gac ggc tec tgg ggc gtg atg 96 
Ala Ala Lys Glu Arg He Leu He Leu Asp Gly Ser Trp Gly Val Met 
20 25 30 

ttc cag aag aag ggg ctg acc gag gee gac tac cgc gee gag cgc ttc 144 
Phe Gin Lys Lys Gly Leu Thr Glu Ala Asp Tyr Arg Ala Glu Arg Phe 
35 40 45 

gec gec tac aac ggc cag atg aag ggc aat aac gac ate ctg tgc ctg 192 
Ala Ala Tyr Asn Gly Gin Met Lys Gly Asn Asn Asp He Leu Cys Leu 
50 55 eo 

acg egg ccc gat etc gtg gee gag ctg cac gac gee tat ttc age gee 240 
Thr Arg Pro Asp Leu Val Ala Glu Leu His Asp Ala Tyr Phe Ser Ala 
65 70 75 80 

ggc gee gac ate tec gag acc aac acc ttc teg ggc acc acc ate gec 288 
Gly Ala Asp He Ser Glu Thr Asn Thr Phe Ser Gly Thr Thr He Ala 
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85 90 95 

cag gcc gac tat cat ctg ggt gaa cag gat gtc tgg gac ate aac ctg 336 
Gin Ala Asp Tyr His Leu Gly Glu Gin Asp Val Trp Asp lie Asn Leu 
100 105 110 

gaa ggc gcc aag ate ggc cgc teg gtg gcc gac cgc tgg aac gcg cag 3 84 
Glu Gly Ala Lys He Gly Arg Ser Val Ala Asp Arg Trp Asn Ala Gin 
115 120 125 

aat ccc gac cgc ccg aag ttc ate gcc ggc teg atg ggg ccg ctg aac 432 
Asn Pro Asp Arg Pro Lys Phe He Ala Gly Ser Met Gly Pro Leu Asn 
130 135 140 

gtc atg ctg teg atg teg teg gac gtg aac gat ccg ggc gcg cgc aag 480 
Val Met Leu Ser Met Ser Ser Asp Val Asn Asp Pro Gly Ala Arg Lys 
14 * 150 155 160 

gtg acc ttc gac cag gtc tac gag gcc tat cgc cag cag gtg gat gcg 528 
Val Thr Phe Asp Gin Val Tyr Glu Ala Tyr Arg Gin Gin Val Asp Ala 
165 170 175 

ctt tac cag ggc ggg gtc gat etc ttc ctg ate gag acc ate acc gac 576 
Leu Tyr Gin Gly Gly Val Asp Leu Phe Leu He Glu Thr He Thr Asp 
180 185 190 

acc ctg aac tgc aag gcc gcg ate aag gcg ate ctg gac tgg cgc gac 624 
Thr Leu Asn Cys Lys Ala Ala He Lys Ala He Leu Asp Trp Arg Asp 
195 200 205 

gag ggc cac gag gag ctg ccg ate tgg ate age ggc acc ate acc gat 672 
Glu Gly His Glu Glu Leu Pro He Trp He Ser Gly Thr He Thr Asp 
210 215 220 

cgc teg ggc cgc acc ctg teg ggc cag acg gcc gag gcg ttc tgg aac 720 
Arg Ser Gly Arg Thr Leu Ser Gly Gin Thr Ala Glu Ala Phe Trp Asn 
225 230 235 240 

age gtc aag cac gcc aag ccg ttc gca gtg ggc ttc aac tgc gcc ctg 768 
Ser Val Lys His Ala Lys Pro Phe Ala Val Gly Phe Asn Cys Ala Leu 
245 250 255 

ggc gcg gat ttg atg cgt ccg cac ate gcc gag atg gcc cgt ate gcc 816 
Gly Ala Asp Leu Met Arg Pro His He Ala Glu Met Ala Arg He Ala 
260 265 270 

gac acc ctg gtc gca gcc tat ccc aac gcc ggc ctg ccc aac gcc atg 864 
Asp Thr Leu Val Ala Ala Tyr Pro Asn Ala Gly Leu Pro Asn Ala Met 
275 280 285 

ggc cag tac gac gag gag ccg cac gag acc ggc cac gcc ctg cac gag 912 
Gly Gin Tyr Asp Glu Glu Pro His Glu Thr Gly His Ala Leu His Glu 
290 295 300 



tgg gcc aag gac ggc etc gtc aac ate ctg ggc ggc tgc tgc ggc acg 
Trp Ala Lys Asp Gly Leu Val Asn He Leu Gly Gly Cys Cys Gly Thr 
305 310 315 320 



960 



aca ccg gac cac ate cgt cac gtc gcc gac gag gtg cgc ggc gtg acg 1008 
Thr Pro Asp His lie Arg His Val Ala Asp Glu Val Arg Gly Val Thr 
325 330 335 
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ccg cgc cag ate ccc gag cgc ccc aag gec atg cgc ctg gcg ggc etc 1056 
Pro Arg Gin lie Pro Olu Arg Pro Lye Ala Met Arg Leu Ala Gly Leu 
340 345 350 

♦ * 
gaa ccg ttc gag ttg get tag tgg eta egg ccg caa att ccc ttc tec 1104 
Glu Pro Phe Glu Leu Ala Xaa Trp Leu Arg Pro Gin He Pro Phe Ser 
355 360 365 

cct tgc ggg aga agg tgt cgc cga agg cga egg atg agg ggt etc gec 1152 
Pro Cys Gly Arg Arg Cys Arg Arg Arg Arg Arg Met Arg Qly Leu Ala 
370 375 380 

ggc cct tea acc get gtc teg egg egg cga cgt tct tea ace cct cat 1200 
Gly Pro Ser Thr Ala Val Ser Arg Arg Arg Arg Ser Ser Thr Pro His 
385 390 395 400 

ccg acc cgc tgc gcg ggc cac ctt etc ccg caa ggg gag aag gga tga 1248 
Pro Thr Arg Cys Ala Gly His Leu Leu Pro Gin Gly Glu Lys Gly Xaa 
405 410 415 

ctg eta ttg gat cct gaa atg cgc ccc gtc ttc gtc aac ate ggt gag 1296 
Leu Leu Leu Asp Pro Glu Met Arg Pro Val Phe Val Asn He Gly Glu 
420 425 430 

cgc acc aac gtc acc ggc teg gec aag ttc aag aag ctg ate gtc gaa 1344 
Arg Thr Asn Val Thr Gly Ser Ala Lys Phe Lys Lys Leu He Val Glu 
435 440 445 

ggg aac tat ccc gag gcg ctg teg gtc gcg cgc cag cag gtc gag gee 1392 
Gly Asn Tyr Pro Glu Ala Leu Ser Val Ala Arg Gin Gin Val Glu Ala 
450 455 460 

ggg gee cag gtc ate gac gtg aac atg gac gag ggt ctg ctg gac age 1440 
Gly Ala Gin Val He Asp Val Asn Met Asp Glu Gly Leu Leu Asp Ser 
465 470 475 480 

cag cag gee atg gtc acc ttc ctg aat ctg atg gcg gee gag ccc gac 1488 
Gin Gin Ala Met Val Thr Phe Leu Asn Leu Met Ala Ala Glu Pro Asp 
485 490 495 

ate gcg cgc gtg ccg gtg atg ate gac age tec aag tgg gag gtg ate 1536 
He Ala Arg Val Pro Val Met He Asp Ser Ser Lys Trp Glu Val lie 
500 505 510 

gag gcg ggc ctg aag tgc gta caa ggc aag gcg ate gtc aac teg ate 1584 
Glu Ala Gly Leu Lys Cys Val Gin Gly Lys Ala He Val Asn Ser He 
515 520 525 

age ctg aag gaa ggc gag gaa aag ttc etc gaa cag gee acg etc tgc 1632 
Ser Leu Lys Glu Gly Glu Glu Lys Phe Leu Glu Gin Ala Thr Leu Cys 
530 535 540 

ctg cgc tat ggc gca gec gtg gtg gtc atg gee ttc gac gag gtt ggc 1680 
Leu Arg Tyr Gly Ala Ala Val Val Val Met Ala Phe Asp Glu Val Gly 
545 550 555 560 

cag gee gac acc gaa aag cgc aag gtc gag ate tgt acg egg gec tac 1728 
Gin Ala Asp Thr Glu Lys Arg Lys Val Glu He Cys Thr Arg Ala Tyr 
565 570 575 

aac acg etc gtg gac aag gtc ggc ttc ccg ccc gag gac ate ate ttc 1776 
Asn Thr Leu Val Asp Lys Val Gly Phe Pro Pro Glu Asp He He Phe 
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580 585 590 

gac ccc aac ate ttc gec gtg gcg acg ggg ate gag gag cac gac aac 1824 
Asp Pro Asn He Phe Ala Val Ala Thr Gly He Glu Glu His Asp Asn 
595 600 605 

tac gec gtc gac ttc ate gag gec acg egg cgc ate aag cag atg ttg 1872 
Tyr Ala Val Asp Phe He Glu Ala Thr Arg Arg He Lys Gin Met Leu 
610 615 620 

ccc tat gcg egg gtg teg ggc ggg gtg teg aac gtc teg ttc age ttc 1920 
Pro Tyr Ala Arg Val Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe 
625 630 635 640 

egg ggc aat gag ccg gtg cgc egg gcg ate cac teg gtg ttc ctg tac 1968 
Arg Gly Asn Glu Pro Val Arg Arg Ala He His Ser Val Phe Leu Tyr 
645 650 655 

cac gee ate aac gee ggc atg gac atg ggc ate gtc aac gec ggc gac 2016 
His Ala He Asn Ala Gly Met Asp Met Gly He Val Asn Ala Gly Asp 
660 665 670 

ctg ccg gtc tat gac gac ate gat ccg gee ctg cgc gag gee gtc gag 2064 
Leu Pro Val Tyr Asp Asp He Asp Pro Ala Leu Arg Glu Ala Val Glu 
675 680 685 

gac gtg ate etc aac egg ccg cag cgc gat ccg gtg atg ace aac acc 2112 
Asp Val He Leu Asn Arg Pro Gin Arg Asp Pro Val Met Thr Asn Thr 
690 695 700 

gag cgc ctg gtc gag atg gec ccg cgc tat aag ggc gag aag ggg cag 2160 
Glu Arg Leu Val Glu Met Ala Pro Arg Tyr Lys Gly Glu Lys Gly Gin 
705 710 715 720 

cag cag gtc gee aac ctg gag tgg cga aag ggc acg gtg aac gag cgc 2208 
Gin Gin Val Ala Asn Leu Glu Trp Arg Lys Gly Thr Val Asn Glu Arg 
725 730 735 

ctg acc cat get etc gtt cac ggc ate acc gag ttc ate gag cag gac 2256 
Leu Thr His Ala Leu Val His Gly He Thr Glu Phe He Glu Gin Asp 
740 745 750 

acc gag gag gcg cgc ctg gee gee gag cgc ccc ttg cac gtg att gaa 2304 
Thr Glu Glu Ala Arg Leu Ala Ala Glu Arg Pro Leu His Val He Glu 
755 760 765 

ggc ccg ctg atg gac ggc atg aac gtc gtc ggc gac ctg ttc ggc gcg 2352 
Gly Pro Leu Met Asp Gly Met Asn Val Val Gly Asp Leu Phe Gly Ala 
770 775 780 

ggc aag atg ttc ctg ccc cag gtg gtg aag teg gec cgc gtg atg aag 2400 
Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala Arg Val Met Lys 
785 790 795 800 

cag gee gtc gee tgg ctg atg ccg ttc atg gag gee gag aag gaa ggc 2448 
Gin Ala Val Ala Trp Leu Met Pro Phe Met Glu Ala Glu Lys Glu Gly 
805 610 81S 

cag gag cgc aag gec gee ggc aag gtg ctg atg gee acc gtc aag ggc 24 96 
Gin Glu Arg Lys Ala Ala Gly Lys Val Leu Met Ala Thr Val Lys Gly 
820 825 830 
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gac gtc cac gac ate ggt aag aac ate gtc ggc gtc gtg ctg cag tgt 
Asp Val His Asp lie Gly Lys Asn lie Val Gly Val Val Leu Gin Cye 
835 840 845 



2544 



aac aac tac gag gtc gtg gac ctg ggt gtc atg gtg ccc gec gac cgc 2592 
Asn Asn Tyr Glu Val Val Asp Leu Gly Val Met Val Pro Ala Asp Arg 
850 855 860 

ate ctg gac gaa gec aag aag cac aag gtc gac atg ate ggc ctg teg 2640 
lie Leu Asp Glu Ala Lys Lys His Lys Val Asp Met lie Gly Leu Ser 
865 870 875 880 



ggc ctg ate ace ccc teg ctg gac gag atg gtg ttc gtg gec gec gag 
Gly Leu He Thr Pro Ser Leu Asp Glu Met Val Phe Val Ala Ala Glu 
885 890 895 



2688 



atg gag cgc cag ggc ttt gat ate ccg ctg ctg ate ggc ggc gee ace 
Met Glu Arg Gin Gly Phe Asp He Pro Leu Leu He Gly Gly Ala Thr 
900 905 910 



2736 



acc age cgc ace cac ace gcg gtg aag ate gag ccg gee tat cgc egg 
Thr Ser Arg Thr His Thr Ala Val Lys He Glu Pro Ala Tyr Arg Arg 
915 920 925 



2784 



ggt ccg acg acc tat gtc gtc gac gec age cgc gec gtg ggc gtg gtc 
Gly Pro Thr Thr Tyr Val Val Asp Ala Ser Arg Ala Val Gly Val Val 
930 935 940 



2832 



teg ggc ctg ctg teg gaa ggc gag cgt gac egg ate ate gee gag acc 
Ser Gly Leu Leu Ser Glu Gly Glu Arg Asp Arg He He Ala Glu Thr 
945 950 955 960 



2880 



cgc gee gag tat gtg aag gtc cgc gag caa tac gcg cgc ggc cag acc 
Arg Ala Glu Tyr Val Lys Val Arg Glu Gin Tyr Ala Arg Gly Gin Thr 
965 970 975 



2928 



acc aag gee cgc gee teg ate cag gag gee cgc aag cgc gee ttc gee 
Thr Lys Ala Arg Ala Ser He Gin Glu Ala Arg Lys Arg Ala Phe Ala 
980 985 990 



2976 



att gac tgg aag ggc tat gcg ccg ccc aag ccc gec ttc ate ggc acg 
He Asp Trp Lys Gly Tyr Ala Pro Pro Lys Pro Ala Phe He Gly Thr 
995 1000 1005 



3024 



egg gtg ttc gag ccg teg ctg gee gag ctg gtc ccg ttc ate gac tgg 
Arg Val Phe Glu Pro Ser Leu Ala Glu Leu Val Pro Phe He Asp Trp 
1010 1015 1020 



3072 



teg ccg ttc ttc gee age tgg gag ctg ate ggc cgc ttc ccg cag ate 
Ser Pro Phe Phe Ala Ser Trp Glu Leu He Gly Arg Phe Pro Gin He 
1025 1030 1035 1040 



3120 



ctg gag gac gac gtg gtc ggc cag gee gec acc gac etc tac cgc gac 
Leu Glu Asp Asp Val Val Gly Gin Ala Ala Thr Asp Leu Tyr Arg Asp 
1045 1050 1055 



3166 



gee cgc gee atg ctg gac aag gtg gtc gag gaa aag tgg ttc ggg gee 
Ala Arg Ala Met Leu Asp Lys Val Val Glu Glu Lys Trp Phe Gly Ala 
1060 1065 1070 



3216 



aag ggc gtg ate ggc ttc tgg 
Lys Gly Val He Gly Phe Trp 



ccg gec cag gee cag 
Pro Ala Gin Ala Gin 



ggc gac gac ate 3264 
Gly Asp Asp He 
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ioeo 



1085 



gtg etc tat acc gac gag acc cgc gtg gec gag ttc teg cgc ctg cac 
Val Leu Tyr Thr Asp Glu Thr Arg Val Ala Glu Phe Ser Arg Leu His 
1090 1095 1100 



3312 



aec ctt cge cag cag atg gac aag ggc gec gac aag age ggc gag gec 
Thr Leu Arg Gin Gin Met Asp Lys Gly Ala Asp Lys Ser Gly Glu Ala 
1105 1110 1115 1120 



3360 



aag gec aat gtc gee ctg teg gac ttc gtc gcg ccg ate ggg cag ggg 
Lys Ala Asn Val Ala Leu Ser Asp Phe Val Ala Pro He Gly Gin Gly 
1125 1130 1135 



3408 



get gac tat gtc ggc ggc ttc gee gtc acc gca ggc cat ggc gag gac 
Ala Asp Tyr Val Gly Gly Phe Ala Val Thr Ala Gly His Gly Glu Asp 
1140 1145 1150 



3456 



gag ate gtc gec aag ttc aag gcg gee ggc gac gac tac aac gec ate 
Glu He Val Ala Lys Phe Lys Ala Ala Gly Asp Asp Tyr Asn Ala He 
1155 1160 1165 



3504 



atg gee teg gec ctg gee gac cgc ctg gee gaa gec ttc gee gag tgg 
Met Ala Ser Ala Leu Ala Abp Arg Leu Ala Glu Ala Phe Ala Glu Trp 
1170 1175 1180 



3552 



ctg cac tac aaa gec cgt gtc gag ctg tgg ggc tac gee gec gac gag 
Leu His Tyr Lys Ala Arg Val Glu Leu Trp Gly Tyr Ala Ala Asp Glu 
1185 1190 1195 1200 



3600 



gac gee gac gtc gag cgc ctg ate gec gaa aag tac cag ggc ate cgc 
Asp Ala Asp Val Glu Arg Leu He Ala Glu Lys Tyr Gin Gly He Arg 
1205 1210 1215 



3648 



ccc gcg ccc ggc tat ccg gee cag ccc gac cac acc gag aaa ggt acg 
Pro Ala Pro Gly Tyr Pro Ala Gin Pro Asp His Thr Glu Lys Gly Thr 
1220 1225 1230 



3696 



ctg ttc aag ctg etc gac gee gag gcg gee acc ggt ctg cag ctg acc 
Leu Phe Lys Leu Leu Asp Ala Glu Ala Ala Thr Gly Leu Gin Leu Thr 
1235 1240 1245 



3744 



gag age tac gee atg acc cct ggc gcg gcg gtc tec ggc ctg ttc ttc 
Glu Ser Tyr Ala Met Thr Pro Gly Ala Ala Val Ser Gly Leu Phe Phe 
1250 1255 1260 



3792 



age cac cgc cag gcg cac tat ttc ggg gtc ggc aag ate gac gee gac 
Ser His Arg Gin Ala His Tyr Phe Gly Val Gly Lys He Asp Ala Asp 
1265 1270 1275 1280 



3840 



cag gtc gag gac tac gec cgc cgc aag ggc tgg gat atg gag acg gee 3888 
Gin Val Glu Asp Tyr Ala Arg Arg Lys Gly Trp Asp Met Glu Thr Ala 
1285 1290 1295 



gag cgc tgg ctg teg ccg ate ctg aac tac gat ccg eta gcg egg gcg 
Glu Arg Trp Leu Ser Pro He Leu Asn Tyr Asp Pro Leu Ala Arg Ala 
1300 1305 1310 



3936 



cgc ggg gcg gcg get tag 
Arg Gly Ala Ala Ala 
1315 



3954 
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<210> 40 
<211> 1317 
<212> PRT 

<213> Caulobacter_crescentuB 
<220> 

<221> unsure 
<222> 359 . . 359 

<223> All occurrences of Xaa indicate any amino acid 
<220> 

<221> unsure 
<222> 416 . . 416 

<223> All occurrences of Xaa indicate any amino acid 
<400> 40 

Met Thr Asp Leu Ser He Arg Ala Asn Arg Val Ala Ala Leu Lys Ala 
15 10 15 

Ala Ala Lys Glu Arg He Leu He Leu Asp Gly Ser Trp Gly Val Met 
20 25 30 

Phe Gin Lys Lys Gly Leu Thr Glu Ala Asp Tyr Arg Ala Glu Arg Phe 
35 40 45 

Ala Ala Tyr Asn Gly Gin Met Lys Gly Asn Asn Asp lie Leu Cyo Leu 
50 55 60 

Thr Arg Pro Asp Leu Val Ala Glu Leu His Asp Ala Tyr Phe Ser Ala 
65 70 75 80 

Gly Ala Asp He Ser Glu Thr Asn Thr Phe Ser Gly Thr Thr He Ala 
85 90 95 

Gin Ala Asp Tyr His Leu Gly Glu Gin Abp Val Trp Asp He Asn Leu 
100 105 110 

Glu Gly Ala Lys He Gly Arg Ser Val Ala Asp Arg Trp Asn Ala Gin 
115 120 125 

Asn Pro Asp Arg Pro Lys Phe He Ala Gly Ser Met Gly Pro Leu Asn 
130 135 140 

Val Met Leu Ser Met Ser Ser Asp Val Asn Asp Pro Gly Ala Arg Lys 
145 150 155 160 

Val Thr Phe Asp Gin Val Tyr Glu Ala Tyr Arg Gin Gin Val Asp Ala 
165 170 175 

Leu Tyr Gin Gly Gly Val Asp Leu Phe Leu He Glu Thr He Thr Asp 
180 185 190 

Thr Leu Asn Cys Lys Ala Ala He Lys Ala He Leu Asp Trp Arg Asp 
195 200 205 

Glu Gly His Glu Glu Leu Pro He Trp He Ser Gly Thr He Thr Asp 
210 215 220 

Arg Ser Gly Arg Thr Leu Ser Gly Gin Thr Ala Glu Ala Phe Trp Asn 
225 230 235 240 
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Ser Val Lys His Ala Lys Pro Phe Ala Val Gly Phe Asn Cya Ala Leu 
245 250 255 

Gly Ala Asp Leu Met Arg Pro Hie lie Ala Glu Met Ala Arg lie Ala 
260 265 270 

Asp Thr Leu Val Ala Ala Tyr Pro Asn Ala Gly Leu Pro Asn Ala Met 
275 2B0 285 

Gly Gin Tyr Asp Glu Glu Pro His Glu Thr Gly His Ala Leu His Glu 
290 295 300 

Trp Ala Lys Asp Gly Leu Val Asn lie Leu Gly Gly Cys Cys Gly Thr 
305 310 315 320 

Thr Pro Asp His He Arg His Val Ala Asp Glu Val Arg Gly Val Thr 
325 330 335 

Pro Arg Gin He Pro Glu Arg Pro Lys Ala Met Arg Leu Ala Gly Leu 
340 345 350 

Glu Pro Phe Glu Leu Ala Xaa Trp Leu Arg Pro Gin He Pro Phe Ser 
355 360 365 

Pro Cys Gly Arg Arg Cys Arg Arg Arg Arg Arg Met Arg Gly Leu Ala 
370 375 380 

Gly Pro Ser Thr Ala Val Ser Arg Arg Arg Arg Ser Ser Thr Pro His 
385 390 395 400 

Pro Thr Arg Cys Ala Gly His Leu Leu Pro Gin Gly Glu Lys Gly Xaa 
405 410 415 

Leu Leu Leu Asp Pro Glu Met Arg Pro Val Phe Val Asn He Gly Glu 
420 425 430 

Arg Thr Asn Val Thr Gly Ser Ala Lys Phe Lys Lys Leu He Val Glu 
435 440 445 

Gly Asn Tyr Pro Glu Ala Leu Ser Val Ala Arg Gin Gin Val Glu Ala 
450 455 460 

Gly Ala Gin Val He Asp Val Asn Met Asp Glu Gly Leu Leu Asp Ser 
465 470 475 460 

Gin Gin Ala Met Val Thr Phe Leu Asn Leu Met Ala Ala Glu Pro Asp 
485 490 495 

He Ala Arg Val Pro Val Met He Asp Ser Ser Lys Trp Glu Val He 
500 505 510 

Glu Ala Gly Leu Lys Cys Val Gin Gly Lys Ala He Val Asn Ser He 
515 520 525 

Ser Leu Lys Glu Gly Glu Glu Lys Phe Leu Glu Gin Ala Thr Leu Cys 
530 535 540 

Leu Arg Tyr Gly Ala Ala Val Val Val Met Ala Phe Asp Glu Val Gly 
545 550 555 560 

Gin Ala Asp Thr Glu Lys Arg Lys Val Glu He Cys Thr Arg Ala Tyr 
565 570 575 
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Asn Thr Leu Val Asp LyB Val Gly phe Pro Pro Glu Asp He He Phe 
580 585 * 590 

Asp* Pro Asn He Phe Ala Val Ala Thr Gly He Glu Glu His Asp Asn 
595 600 605 

Tyr Ala Val Asp Phe He Glu Ala Thr Arg Arg lie Lys Gin Met Leu 
610 615 620 

Pro Tyr Ala Arg Val Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe 
625 630 635 640 

Arg Gly Asn Glu Pro Val Arg Arg Ala He H1b Ser Val Phe Leu Tyr 
645 650 655 

His Ala He Asn Ala Gly Met Asp Met Gly He Val Asn Ala Gly Asp 
660 665 670 

Leu Pro Val Tyr Asp Asp He Asp Pro Ala Leu Arg Glu Ala Val Glu 
675 680 685 

Asp Val He Leu Asn Arg Pro Gin Arg Asp Pro Val Met Thr Asn Thr 
690 695 700 

Glu Arg Leu Val Glu Met Ala Pro Arg Tyr Lys Gly Glu Lys Gly Gin 
70S 710 715 720 

Gin Gin Val Ala Asn Leu Glu Trp Arg Lys Gly Thr Val Asii Glu Arg 
725 730 735 

Leu Thr His Ala Leu Val His Gly He Thr Glu Phe He Glu Gin Asp 
740 745 750 

Thr Glu Glu Ala Arg Leu Ala Ala Glu Arg Pro Leu His Val He Glu 
755 760 765 

Gly Pro Leu Met Asp Gly Met Asn Val Val Gly Asp Leu Phe Gly Ala 
770 775 780 

Gly Lys Met Phe Leu Pro Gin Val Val LyB Ser Ala Arg Val Met Lys 
785 790 795 800 

Gin Ala Val Ala Trp Leu Met Pro Phe Met Glu Ala Glu Lys Glu Gly 
805 810 815 

Gin Glu Arg Lys Ala Ala Gly Lys Val Leu Met Ala Thr Val Lys Gly 
820 825 830 

Asp Val His Asp He Gly Lys Asn He Val Gly Val Val Leu Gin Cys 
835 840 845 

Asn Asn Tyr Glu Val Val Asp Leu Gly Val Met Val Pro Ala Asp Arg 
850 855 * 860 

lie Leu Asp Glu Ala Lys Lys His Lys Val Asp Met He Gly Leu Ser 
865 870 875 880 

Gly Leu He Thr Pro Ser Leu Asp Glu Met Val Phe Val Ala Ala Glu 
885 890 695 

Met Glu Arg Gin Gly Phe Asp He Pro Leu Leu He Gly Gly Ala Thr 
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900 905 910 

Thr Ser Arg Thr His Thr Ala Val Lye He Glu Pro Ala Tyr Arg Arg 
915 920 925 

Gly Pro Thr Thr Tyr Val Val Asp Ala Ser Arg Ala Val Gly Val Val 
930 935 940 

Ser Gly Leu Leu Ser Glu Gly Glu Arg Asp Arg He He Ala Glu Thr 
945 950 955 960 

Arg Ala Glu Tyr Val Lye Val Arg Glu Gin Tyr Ala Arg Gly Gin Thr 
965 970 975 

Thr Lys Ala Arg Ala Ser He Gin Glu Ala Arg Lys Arg Ala Phe Ala 
980 985 990 

He Asp Trp Lys Gly Tyr Ala Pro Pro Lys Pro Ala Phe He Gly Thr 
995 1000 1005 

Arg Val Phe Glu Pro Ser Leu Ala Glu Leu Val Pro Phe He Asp Trp 
1010 1015 1020 

Ser Pro Phe Phe Ala Ser Trp Glu Leu lie Gly Arg Phe Pro Gin He 
1025 1030 1035 1040 

Leu Glu Asp Asp Val Val Gly Gin Ala Ala Thr Asp Leu Tyr Arg Asp 
1045 1050 * 1055 

Ala Arg Ala Met Leu Asp Lys Val Val Glu Glu Lys Trp Phe Gly Ala 
1060 1065 " 1070 

Lys Gly Val He Gly Phe Trp Pro Ala Gin Ala Gin Gly Asp Asp He 
1075 1080 1085 

Val Leu Tyr Thr Asp Glu Thr Arg Val Ala Glu Phe Ser Arg Leu His 
1090 1095 1100 

Thr Leu Arg Gin Gin Met Asp Lys Gly Ala Asp Lys Ser Gly Glu Ala 
H05 1110 ins 1120 

Lys Ala Asn Val Ala Leu Ser Asp Phe Val Ala Pro He Gly Gin Gly 
1125 1130 H35 

Ala Asp Tyr Val Gly Gly Phe Ala Val Thr Ala Gly His Gly Glu Asp 
1140 1145 1150 

Glu He Val Ala Lys Phe Lys Ala Ala Gly Asp Asp Tyr Asn Ala He 
1155 1160 1165 

Met Ala Ser Ala Leu Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Trp 
1170 1175 1180 

Leu His Tyr Lys Ala Arg Val Glu Leu Trp Gly Tyr Ala Ala Asp Glu 
1185 1190 1195 1200 

Asp Ala Asp Val Glu Arg Leu He Ala Glu Lys Tyr Gin Gly He Arg 
1205 1210 1215 

Pro Ala Pro Gly Tyr Pro Ala Gin Pro Asp His Thr Glu Lys Gly Thr 
1220 1225 1230 
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Leu Phe Lys Leu Leu Asp Ala Glu Ala Ala Thr Gly Leu Gin Leu Thr 
1235 1240 1245 

Glu Ser Tyr Ala Met Thr Pro Gly Ala Ala Val Ser Gly Leu. Phe Phe 
1250 1255 1260 

Ser His Arg Gin Ala His Tyr Phe Gly Val Gly Lys lie Asp Ala Asp 
1265 1270 1275 1280 

Gin Val Glu Asp Tyr Ala Arg Arg Lys. Gly Trp Asp Met Glu Thr Ala 
1285 1290 1295 

Glu Arg Trp Leu Ser Pro He Leu Asn Tyr Asp Pro Leu Ala Arg Ala 
1300 1305 1310 

Arg Gly Ala Ala Ala 
1315 



<210> 41 
<211> 3759 
<212> DNA 

<213> Rhodobacter capsulatus 

<220> 
<221> CDS 
<222> (1) (3756) 
<223> RRC01731 



<400> 41 

atg ctg acc cag acc ctg ccc cga tct gcg gqc ttt gcc gca att gag 48 

Met Leu Thr Gin Thr Leu Pro Arg Ser Ala Ala Phe Ala Ala He Glu 
15 10 is 

gcg ctt teg cgc cag egg ate ttg ate ctt gac ggg gcg atg ggc acg 96 
Ala Leu Ser Arg Gin Arg He Leu He Leu Asp Gly Ala Met Gly Thr 
20 25 30 

cag ate cag cag ctt ggc ctg age gag gac gat ttt ctg ggc cac ggc 144 
Gin He Gin Gin Leu Gly Leu Ser Glu Asp Asp Phe Leu Gly His Gly 
35 40 45 

teg ggc tgc gcc tgc cgc cat gcc acc gat cat ccg caa aag ggc aac 192 
Ser Gly Cys Ala Cys Arg His Ala Thr Asp His Pro Gin Lys Gly Asn 
50 55 60 

aac gac ctg ctg gtg ctg acc cag ccg caa gcg ate gag gag ate cat 240 
Asn Asp Leu Leu Val Leu Thr Gin Pro Gin Ala He Glu Glu He His 
65 70 75 80 

ttc cgc tat gcg atg gcg ggg gcg gat ate gtc gag acg aac acc ttt 288 
Phe Arg Tyr Ala Met Ala Gly Ala Asp He Val Glu Thr Asn Thr Phe 
85 90 95 

teg gcc acc acc ate gcg cag gcc gat tac ggg ctg gaa age gcg gtg 336 
Ser Ala Thr Thr He Ala Gin Ala Asp Tyr Gly Leu Glu Ser Ala Val 
100 105 no 

ttc gac ctg aac gcc gcg ggg gcg egg gtg gcg egg gcg gcg atg gac 384 
Phe Asp Leu Asn Ala Ala Gly Ala Arg Val Ala Arg Ala Ala Met Asp 
115 120 125 
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cgc gcc gag gcc acc gac gga egg cgc cgc ttc gtt gcg ggg gcg gtg 432 
Arg Ala Glu Ala Thr Asp Gly Arg Arg Arg Phe Val Ala Gly Ala Val 
130 135 140 

ggg ccg acg aac cgc acc gcc teg etc teg ccc gat gtg aac gac ccg 460 
Gly Pro Thr Asn Arg Thr Ala Ser Leu Ser Pro Asp Val Asn Asp Pro 
145 150 155 160 

ggc ttt cgc gcc gtc acc ttc gac gat ctg cgc acg gcc tat ggc cag 528 
Gly Phe Arg Ala Val Thr Phe Asp Asp Leu Arg Thr Ala Tyr Gly Gin 
165 170 175 

cag gtg cgc ggt ctg ate gcg ggg ggc gcc gat ate ctg ctg ate gag 576 
Gin Val Arg Gly Leu lie Ala Gly Gly Ala Asp He Leu Leu He Glu 
180 1B5 190 

acg ate ttt gac acg ctg aac gcc aag gcg gcg att ttc gcc tgt ttc 624 
Thr He Phe Asp Thr Leu Asn Ala Lys Ala Ala He Phe Ala Cys Phe 
195 200 205 

gaa gcc ttt gcc gaa egg ggc gag egg ctg ccg gtg atg att tec ggc 672 
Glu Ala Phe Ala Glu Arg Gly Glu Arg Leu Pro Val Met He Ser Gly 
210 215 220 

acg ate acc gat gcc teg ggg cgc aca ttg teg ggg cag acg ccg acc 720 
Thr He Thr Asp Ala Ser Gly Arg Thr Leu Ser Gly Gin Thr Pro Thr 
225 230 235 240 

gcg ttc tgg cat teg gtg get cat gcc egg ccc ttt acc gtg ggg ctg 768 
Ala Phe Trp His Ser Val Ala H1b Ala Arg Pro Phe Thr Val Gly Leu 
245 250 255 

aac tgc gcg ctg ggc gcc agt gcg atg cgt ccg cat ctg gcg gaa ctg 816 
Asn Cys Ala Leu Gly Ala Ser Ala Met Arg Pro His Leu Ala Glu Leu 
260 265 270 

gcg ggc gtc gcc ccc tgc gcg ate tgc gcc tat ccc aat gcc ggg ctg 664 
Ala Gly Val Ala Pro Cys Ala He CyB Ala Tyr Pro Asn Ala Gly Leu 
275 280 285 

ccc aat gcc ttt ggc caa tat gac gaa acc ccc gac egg acc gcc gcg 912 
Pro Asn Ala Phe Gly Gin Tyr Asp Glu Thr Pro Asp Arg Thr Ala Ala 
290 295 300 

cag gtg gcc gaa ttt gcc cgc gaa ggg ctg gtc aat gtc gtg ggc ggt 960 
Gin Val Ala Glu Phe Ala Arg Glu Gly Leu Val Asn Val Val Gly Gly 
305 310 315 320 

tgc tgc ggc acc acc ccc gat cac ate cgc gcc ate gcg gaa gcc gtg 1008 
Cys Cys Gly Thr Thr Pro Asp His He Arg Ala He Ala Glu Ala Val 
325 330 335 

aaa cct ttc ccg ccg agg gcc ctg cca age cgt tat ctg cgc ctt teg 1056 
Lys Pro Phe Pro Pro Arg Ala Leu Pro Ser Arg Tyr Leu Arg Leu Ser 
340 345 350 

ggg ctt gag ccc ttt acc ctg acg ccc gac att ccc ttc gtg aac ate 1104 
Gly Leu Glu Pro Phe Thr Leu Thr Pro Asp He Pro Phe Val Asn He 
355 360 * 365 

ggc gag cgc acg aat gtc acc ggc teg gcc egg ttc cgc aag atg ate 1152 
Gly Glu Arg Thr Asn Val Thr Gly Ser Ala Arg Phe Arg Lys Met He 
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gtc gcc 
Val Ala 
385 


cgc 
Arg 


gac tat gcc 
Asp Tyr Ala 
390 


gcc 
Ala 


gcg 
Ala 


ctg 
Leu 


gat 
Asp 


gtc gcc 
Val Ala 
395 


cgc 
Arg 


gat cag 
Asp ,Gln 


gtg 

Val 
400 


1200 


gaa aac 
Glu Asn 


ggc 
Gly 


gcg cag ate 
Ala Gin He 
405 


ctt 
Leu 


gac 
Asp 


ate 
He 


aac 
Asn 
410 


atg gac 
Met Asp 


gag 
Glu 


ggg 

Gly 


ctg 
Leu 
415 


ate 
He 


1248 


gac agt 
Asp Ser 


cag 
Gin 


gcg gcg atg 
Ala Ala Met 

420 


gtc 
Val 


gcc 
Ala 


ttc 
Phe 
425 


etc 
Leu 


aac etc 
Asn Leu 


ttg 
Leu 


gcc 
Ala 
430 


gcc 
Ala 


gag 
Glu 


1296 


ccc gac 
Pro Asp 


att 
He 
435 


gcc egg gtg 
Ala Arg Val 


ccg 
Pro 


gtg 
Val 
440 


atg 
Met 


ate 
He 


gac age 
Asp Ser 


teg 
Ser 
445 


aaa 
Lys 


tgg 
Trp 


gag 
Glu 


1344 


gtg ate 
Val He 
450 


gag 
Glu 


gcc ggg ctg 
Ala Gly Leu 


aaa 
Lys 
455 


tgc 
Cys 


gtg 
val 


cag 
Gin 


ggc aag 
Gly Lys 
460 


ccc 
Pro 


gtc 
Val 


gtc 
Val 


aat 
Asn 


1392 


teg ate 
Ser He 
465 


age 
Ser 


ctg aag gag 
Leu Lys Glu 
470 


ggc 
Gly 


gag 
Glu 


gag 
Glu 


ate 
He 


ttc cgc 
Phe Arg 
475 


cat 
His 


cac 
His 


gcg 
Ala 


gcg 
Ala 

480 


1440 


ctg tgt 
Leu Cys 


ctg 
Leu 


gcc tat ggc 
Ala Tyr Gly 
485 


gcg 
Ala 


gcg 
Ala 


gtc 
Val 


gtc 
Val 
490 


gtg atg 
Val Met 


gcc 
Ala 


ttt 
Phe 


gac 
Asp 
495 


gaa 
Glu 


1488 


gag ggg 
Glu Gly 


cag 
Gin 


gcc gac agt 
Ala Asp Ser 
500 


ttc 
Phe 


gcc 
Ala 


cga 
Arg 
505 


aag 
Lys 


acc age 
Thr Ser 


ate 
He 


tgc 
Cys 
510 


gcc 
Ala 


cgc 
Arg 


1536 


gcc tat 
Ala Tyr 


cgc 
Arg 
515 


att ctg gtc 
He Leu Val 


gag 
Glu 


gag 
Glu 
520 


ate 
He 


ggc 

Gly 


ttt ccg 
Phe Pro 


ccc 
Pro 
525 


gaa 
Glu 


gac 
Asp 


ate 
He 


1584 


ate ttt 
He Phe 
530 


gac 
Asp 


ccg aac gtc 
Pro Asn Val 


ttt 
Phe 
535 


gcc 
Ala 


gtc 
Val 


gcc 
Ala 


acg ggc 
Thr Gly 
540 


ate 
He 


gaa 
Glu 


gaa 
Glu 


cac 
His 


1632 


gac aat 
Asp Asn 
545 


tac 
Tyr 


ggc gtt gat 
Gly Val Asp 
550 


ttc 
Phe 


ate 
He 


gag 
Glu 


gcc 
Ala 


get cgc 
Ala Arg 
555 


tgg 
Trp 


ate 
He 


egg 
Arg 


gcc 
Ala 
560 


1680 


aac ctg 
Asn Leu 


ccg 
Pro 


cat gcc cat 
His Ala His 
565 


gtc 
Val 


teg 
Ser 


ggc 
Gly 


ggg 

Gly 
570 


gtg teg 
Val Ser 


aac 
Asn 


ctg 
Leu 


tec 
Ser 
575 


ttc 
Phe 


1728 


age ttt 
Ser Phe 


cgc 
Arg 


ggc aac gaa 
Gly Asn Glu 
580 


ccc 
Pro 


gtg 
Val 


cgc 
Arg 
585 


gcg 
Ala 


gcg atg 
Ala Met 


cat 
Hie 


gcg 
Ala 
590 


gtg 
Val 


ttt 
Phe 


1776 


ctt tac 
Leu Tyr 


cac 
His 
595 


gcc ate cgc 
Ala He Arg 


gcc 
Ala 


ggg 

Gly 
600 


atg 
Met 


gat 
Asp 


atg ggg 
Met Gly 


ate 
He 
605 


gtc 
val 


aat 
Asn 


gcc 
Ala 


1824 



ggg cag ctg gtg gtc tat gac cag ate gac ccc gag ctg cgc cag gcc 
Gly Gin Leu Val Val Tyr Asp Gin He Asp Pro Glu Leu Arg Gin Ala 
610 615 620 



1872 
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tgc gag gat gtg gtg etc aac cgc cag ccc aaa teg ggc ggc ace gcg 1920 
Cys Glu Asp Val Val Leu Asn Arg Gin Pro Lys Ser Gly Gly Thr Ala 
625 630 635 640 

acc gag egg atg ctg gag gtg gec gag cgc ttc cgc ggc ggc gcg cgc 1968 
Thr Glu Arg Met Leu Glu Val Ala Glu Arg Phe Arg Gly Gly Ala Arg 
645 650 655 

gag gaa aag acc cgc gat ctg gec tgg cgc gac tgg ccg gtg gaa aag 2016 
Glu Glu Lys Thr Arg Asp Leu Ala Trp Arg Asp Trp Pro Val Glu Lys 
660 665 670 

egg etc gaa cat gcg ctg gtc aat ggc ate acc gaa ttc ate gag gec 2064 
Arg Leu Glu His Ala Leu Val Asn Gly He Thr Glu Phe He Glu Ala 
675 660 685 

gat acc gaa gee gca agg ctt ctg gee gaa cgc ccg ctg cat gtg ate 2112 
Asp Thr Glu Ala Ala Arg Leu Leu Ala Glu Arg Pro Leu His Val He 
690 695 700 

gaa ggg ccg ctg atg gcg ggg atg aat gtc gtc ggt gat ctg ttc ggc 2160 
Glu Gly Pro Leu Met Ala Gly Met Asn Val Val Gly Asp Leu Phe Gly 
705 710 715 720 

gcg ggc aag atg ttc ctg cca cag gtg gtg aaa teg gcg cgc gtg atg 2208 
Ala Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala Arg Val Met 
725 730 735 

aaa cag gee gtc gec gtt ctg ctg ccc tac atg gat gec gaa aag gec 2256 
Lys Gin Ala Val Ala Val Leu Leu Pro Tyr Met Asp Ala Glu Lys Ala 
740 745 750 

gcg cgc ggc ggc gag ggg cgc gaa acc gcg ggc aag ate ctg atg gee 2304 
Ala Arg Gly Gly Glu Gly Arg Glu Thr Ala Gly Lys He Leu Met Ala 
755 760 765 

acg gtc aag ggc gat gtg cat gac ate ggc aag aac ate gtc ggc gtc 2352 
Thr Val Lys Gly Asp Val H1b Asp He Gly Lys Asn He Val Gly Val 
770 775 780 

gtg ctg gee tgc aac aat tac gac ate gtc gac ctg ggc gtg atg gtg 2400 
Val Leu Ala Cys Asn Asn Tyr Asp He Val Asp Leu Gly Val Met Val 
785 790 795 800 

ccg ccg caa aag ate ctg gaa gtg gcg egg gec gaa aag gtc gat gcg 2448 
Pro Pro Gin Lys He Leu Glu Val Ala Arg Ala Glu Lys Val Asp Ala 
805 810 815 

ate ggg ctt tec ggg ctg ate acg cca age ctg gac gag atg gtg cat 2496 
He Gly Leu Ser Gly Leu He Thr Pro Ser Leu Asp Glu Met Val His 
820 825 830 

ctg gec gcg gaa atg gag cgc gag ggc ttt gac att ccg ctg ctg ate 2544 
Leu Ala Ala Glu Met Glu Arg Glu Gly Phe Asp He Pro Leu Leu lie 
835 840 845 

ggc ggg gcg acc acg teg aaa gtg cat acg gcg gtg aag ate gec ccc 2592 
Gly Gly Ala Thr Thr Ser Lys Val His Thr Ala Val Lys He Ala Pro 
850 855 860 

gee tac age cgc ggg cag gcg gtt tat gtg etc gat gee age egg gee 2640 
Ala Tyr Ser Arg Gly Gin Ala Val Tyr Val Leu Asp Ala Ser Arg Ala 
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870 
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875 



880 



gtg ggg gtg gtg ggg gcg ctt ttg age ccg aac cag aag gtc gat tac 
Val Gly Val Val Gly Ala Leu Leu Ser Pro Asn Gin Lye Val, Asp Tyr 
885 890 895 

gcg gcg cag ate cgc gcg gac tat gcg cag ate gee gee cgt cat gee 
Ala Ala Gin lie Arg Ala Asp Tyr Ala Gin lie Ala Ala Arg Hie Ala 
900 905 910 



egg tat ate gac tgg acg ccc ttc ttc cat gee tgg gaa ttg aag ggg 
Arg Tyr He Asp Trp Thr Pro Phe Phe His Ala Trp Glu Leu Lys Gly 
965 970 975 



2688 



2736 



cgc gac gag gee gee aag gtg egg ctg cct ttg gec gcg gee egg gec 2784 

Arg Asp Glu Ala Ala Lys Val Arg Leu Pro Leu Ala Ala Ala Arg Ala 
915 920 925 

aat gcg ctg egg etc gac tgg teg ggc tat gee gtg ccc gcg ccg caa 2832 

Asn Ala Leu Arg Leu Asp Trp Ser Gly Tyr Ala Val Pro Ala Pro Gin 

930 935 940 

ttc ctt ggc ccg cgc gtg ate gac gac tgg gat ctg gec gaa gtg gcg 2880 

Phe Leu Gly Pro Arg Val He Asp Asp Trp Asp Leu Ala Glu Val Ala 
945 950 955 960 



2928 



gtc tat ccg egg att etc gat gac gee gaa aag ggc gaa gcg gcg egg 2976 
Val Tyr Pro Arg He Leu Asp Asp Ala Glu Lys Gly Glu Ala Ala Arg 
980 985 * 990 

gca ctt ttc gee gat gee cag gcg atg ctg gcg cag ate att gec gaa 3024 
Ala Leu Phe Ala Asp Ala Gin Ala Met Leu Ala Gin He He Ala Glu 
995 1000 1005 

cgc tgg ttc ace ccg cgc gec gtg gtg ggg ttc tgg ccc gcg cag gcg 3072 
Arg Trp Phe Thr Pro Arg Ala Val Val Gly Phe Trp Pro Ala Gin Ala 
1010 1015 1020 

gtg ggc gac gat ate egg ctt tac ace gac gag age egg ace gaa gac 3120 
Val Gly Asp Asp He Arg Leu Tyr Thr Asp Glu Ser Arg Thr Glu Asp 
1025 1030 1035 ~ 1040 

etc gee act ttc ttc ace ctg cgc cag cag ace ggc aag cgc gaa ggc 3168 
Leu Ala Thr Phe Phe Thr Leu Arg Gin Gin Thr Gly Lys Arg Glu Gly 
1045 1050 1055 

cgc ccg aat gtg get ttg gec gat ttc gtc gcg cct gcg ggc acg gtg 3216 
Arg Pro Asn Val Ala Leu Ala Asp Phe Val Ala Pro Ala Gly Thr Val 
1060 1065 1070 

ccc gat tat ctg ggc ggc ttc gtg gtc ace gcg ggc ccc gag gaa gee 3264 
Pro Asp Tyr Leu Gly Gly Phe Val Val Thr Ala Gly Pro Glu Glu Ala 
1075 1080 1085 

gag ate gec gcg egg ttc gaa get gec aat gac cat tat tec gcg ate 3312 
Glu He Ala Ala Arg Phe Glu Ala Ala Asn Asp His Tyr Ser Ala He 
1090 1095 1100 



ctg gtc aag gcg ctg gee gac cgc ttt gee gaa gec ctg gee gag gec 
Leu Val Lys Ala Leu Ala Asp Arg Phe Ala Glu Ala Leu Ala Glu Ala 
1105 1110 1115 1120 



3360 
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ctg cat cag egg gtg egg cgc gac tat tgg ggc tat gcg ccc gaa gaa 3408 
Leu Hi 8 Gin Arg Val Arg Arg Asp Tyr Trp Gly Tyr Ala Pro Glu Glu 
1125 1130 1135 

age ttc gee ccc gat cag ctg gtg ggc gag ccc tat cgc ggc ate cgc 3456 
Ser Phe Ala Pro Asp Gin Leu Val Gly Glu Pro Tyr Arg Gly He Arg 
1140 1145 " 1150 

ccg gcg ccc ggc tat ccg gec cag ccc gac cac acg gaa aag ctg acg 3504 
Pro Ala Pro Gly Tyr Pro Ala Gin Pro Asp His Thr Glu Lys Leu Thr 
1155 1160 1165 

ctg ttc egg ctg ctt ggg gee gag gee gcg acc ggc gtg cat ctg acc 3552 
Leu Phe Arg Leu Leu Gly Ala Glu Ala Ala Thr Gly Val His Leu Thr 
1170 1175 1180 

gac age atg gcg atg tgg ccc ggc tct teg gtc teg ggg etc tat ate 3600 
Asp Ser Met Ala Met Trp Pro Gly Ser Ser Val Ser Gly Leu Tyr He 
1185 1190 1195 1200 

ggc cat ccg gag gee tat tat ttc ggt ctg gee egg ate gag cag gat 3648 
Gly His Pro Glu Ala Tyr Tyr Phe Gly Leu Ala Arg lie Glu Gin Asp 
1205 1210 1215 

cag gee gec gat tac gee gee cgc aag ggc atg gee ttg gee gag gtg 3696 
Gin Ala Ala Asp Tyr Ala Ala Arg Lys Gly Met Ala Leu Ala Glu Val 
1220 1225 1230 

cag cgc tgg ctg gee ccg gtg ctg ggg teg gec gcg ccc gee gee get 3744 
Gin Arg Trp Leu Ala Pro Val Leu Gly Ser Ala Ala Pro Ala Ala Ala 
1235 1240 1245 



gcg gtg gee gcg tga 
Ala Val Ala Ala 
1250 



3759 



<210> 42 
<211> 1252 
<212> PRT 

<213> Rhodobacter capsulatus 
<400> 42 

Met Leu Thr Gin Thr Leu Pro Arg Ser Ala Ala Phe Ala Ala lie Glu 
15 10 15 

Ala Leu Ser Arg Gin Arg He Leu He Leu Asp Gly Ala Met Gly Thr 
20 25 30 

Gin He Gin Gin Leu Gly Leu Ser Glu Asp Asp Phe Leu Gly His Gly 
35 40 45 

Ser Gly Cys Ala Cys Arg His Ala Thr Asp His Pro Gin Lys Gly Asn 
50 55 60 

Asn Asp Leu Leu Val Leu Thr Gin Pro Gin Ala He Glu Glu He His 
65 70 75 80 

Phe Arg Tyr Ala Met Ala Gly Ala Asp He Val Glu Thr Asn Thr Phe 
85 90 95 



Ser Ala Thr Thr He Ala Gin Ala Asp Tyr Gly Leu Glu Ser Ala Val 
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100 105 110 

Phe Asp Leu Asn Ala Ala Gly Ala Arg Val Ala Arg Ala Ala Met Asp 
115 120 125 

Arg Ala Glu Ala Thr Asp Gly Arg Arg Arg Phe Val Ala Gly Ala Val 
130 135 140 

Gly Pro Thr Asn Arg Thr Ala Ser Leu Ser Pro Asp Val Asn Asp Pro 
145 150 155 160 

Gly Phe Arg Ala Val Thr Phe Asp Asp Leu Arg Thr Ala Tyr Gly Gin 
165 170 175 

Gin Val Arg Gly Leu lie Ala Gly Gly Ala Asp He Leu Leu He Glu 
180 185 190 

Thr He Phe Asp Thr Leu Asn Ala Lys Ala Ala He Phe Ala Cys Phe 
195 200 205 

Glu Ala Phe Ala Glu Arg Gly Glu Arg Leu Pro Val Met He Ser Gly 
210 215 220 

Thr He Thr Asp Ala Ser Gly Arg Thr Leu Ser Gly Gin Thr Pro Thr 
225 230 235 240 

Ala Phe Trp His Ser Val Ala His Ala Arg Pro Phe Thr Val bly Leu 
245 250 255 

Asn Cys Ala Leu Gly Ala Ser Ala Met Arg Pro His Leu Ala Glu Leu 
260 265 270 

Ala Gly Val Ala Pro Cys Ala He Cys Ala Tyr Pro Asn Ala Gly Leu 
275 280 285 

Pro Asn Ala Phe Gly Gin Tyr Asp Glu Thr Pro Asp Arg Thr Ala Ala 
290 295 300 

Gin Val Ala Glu Phe Ala Arg Glu Gly Leu Val Asn Val Val Gly Gly 
305 310 315 320 

Cys Cys Gly Thr Thr Pro Asp His He Arg Ala lie Ala Glu Ala Val 
325 330 335 

Lys Pro Phe Pro Pro Arg Ala Leu Pro Ser Arg Tyr Leu Arg Leu Ser 
340 345 350 

Gly Leu Glu Pro Phe Thr Leu Thr Pro Asp He Pro Phe Val Asn lie 
355 360 365 

Gly Glu Arg Thr Asn Val Thr Gly Ser Ala Arg Phe Arg Lys Met lie 
370 375 380 

Val Ala Arg Asp Tyr Ala Ala Ala Leu Asp Val Ala Arg Asp Gin Val 
385 390 395 400 

Glu Asn Gly Ala Gin lie Leu Asp lie Asn Met Asp Glu Gly Leu lie 
405 410 415 

Asp Ser Gin Ala Ala Met Val Ala Phe Leu Asn Leu Leu Ala Ala Glu 
420 425 430 



WO 03/087386 PCT/EP03/04010 

187 

Pro Asp lie Ala Arg Val Pro Val Met He Asp Ser Ser Lys Trp Glu 
435 440 445 

Val He Glu Ala Gly Leu Lys Cys Val Gin Gly Lys Pro Val Val Abii 
450 455 460 

Ser He Ser Leu Lys Glu Gly Glu Glu He Phe Arg His H1b Ala Ala 
465 470 475 480 

Leu Cys Leu Ala Tyr Gly Ala Ala Val Val Val Met Ala Phe Asp Glu 
485 490 495 

Glu Gly Gin Ala Asp Ser Phe Ala Arg Lys Thr Ser He Cys Ala Arg 
500 505 510 

Ala Tyr Arg He Leu Val Glu Glu He Gly Phe Pro Pro Glu Asp He 
515 520 525 

He Phe Asp Pro Asn Val Phe Ala Val Ala Thr Gly He Glu Glu His 
530 535 540 

Asp Asn Tyr Gly Val Asp Phe He Glu Ala Ala Arg Trp He Arg Ala 
545 550 555 560 

Asn Leu Pro His Ala His Val Ser Gly Gly Val Ser Asn Leu Ser Phe 
565 570 575 

Ser Phe Arg Gly Asn Glu Pro Val Arg Ala Ala Met His Ala Val Phe 
580 585 590 

Leu Tyr His Ala He Arg Ala Gly Met Asp Met Gly He Val Asn Ala 
595 600 605 

Gly Gin Leu Val Val Tyr Asp Gin He Asp Pro Glu Leu Arg Gin Ala 
610 615 620 

Cys Glu Asp Val Val Leu Asn Arg Gin Pro Lys Ser Gly Gly Thr Ala 
625 630 635 640 

Thr Glu Arg Met Leu Glu Val Ala Glu Arg Phe Arg Gly Gly Ala Arg 
645 650 655 

Glu Glu Lys Thr Arg Asp Leu Ala Trp Arg Asp Trp Pro Val Glu Lys 
660 665 ^ 670 

Arg Leu Glu His Ala Leu Val Asn Gly He Thr Glu Phe He Glu Ala 
675 680 685 

Asp Thr Glu Ala Ala Arg Leu Leu Ala Glu Arg Pro Leu His Val He 
690 695 700 

Glu Gly Pro Leu Met Ala Gly Met Asn Val Val Gly Asp Leu Phe Gly 
705 710 715 720 

Ala Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala Arg Val Met 
725 730 735 

Lys Gin Ala Val Ala Val Leu Leu Pro Tyr Met Asp Ala Glu Lys Ala 
740 745 750 

Ala Arg Gly Gly Glu Gly Arg Glu Thr Ala Gly Lys He Leu Met Ala 
755 760 765 
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Thr Val Lys Gly Asp Val His Asp lie Gly Lys Asn He Val Gly Val 
770 775 780 

Val ' Leu Ala Cys Asn Asn Tyr Asp He Val Asp Leu Gly Val Met Val 
785 790 795 , 800 

Pro Pro Gin Lys He Leu Glu val Ala Arg Ala Glu Lys Val Asp Ala 
805 810 815 

He Gly Leu Ser Gly Leu He Thr Pro Ser Leu Asp Glu Met Val His 
820 825 830 

Leu Ala Ala Glu Met Glu Arg Glu Gly Phe Asp He Pro Leu Leu He 
835 840 845 

Gly Gly Ala Thr Thr Ser Lys Val His Thr Ala Val Lys He Ala Pro 
850 855 860 

Ala Tyr Ser Arg Gly Gin Ala Val Tyr Val Leu Asp Ala Ser Arg Ala 
865 870 87S 880 

Val Gly Val Val Gly Ala Leu Leu Ser Pro Asn Gin Lys Val Asp Tyr 
865 890 895 

Ala Ala Gin He Arg Ala Asp Tyr Ala Gin He Ala Ala Arg HIb Ala 
900 905 910 

Arg Asp Glu Ala Ala Lys Val Arg Leu Pro Leu Ala Ala Ala Arg Ala 
915 920 925 

Asn Ala Leu Arg Leu Asp Trp Ser Gly Tyr Ala Val Pro Ala Pro Gin 
930 935 940 

Phe Leu Gly Pro Arg Val He Asp Asp Trp Asp Leu Ala Glu Val Ala 
945 950 955 960 

Arg Tyr He Asp Trp Thr Pro Phe Phe His Ala Trp Glu Leu Lys Gly 
965 970 ~ 975 

Val Tyr Pro Arg He Leu Asp Asp Ala Glu Lys Gly Glu Ala Ala Arg 
980 985 990 

Ala Leu Phe Ala Abp Ala Gin Ala Met Leu Ala Gin He He Ala Glu 
995 1000 1005 

Arg Trp Phe Thr Pro Arg Ala Val Val Gly Phe Trp Pro Ala Gin Ala 
1010 1015 1020 

Val Gly Asp Asp He Arg Leu Tyr Thr Asp Glu Ser Arg Thr Glu Asp 
1025 1030 1035 1040 

Leu Ala Thr Phe Phe Thr Leu Arg Gin Gin Thr Gly Lys Arg Glu Gly 
1045 1050 1055 

Arg Pro Asn Val Ala Leu Ala Asp Phe Val Ala Pro Ala Gly Thr Val 
1060 1065 1070 

Pro Asp Tyr Leu Gly Gly Phe Val Val Thr Ala Gly Pro Glu Glu Ala 
1075 1080 1085 

Glu He Ala Ala Arg Phe Glu Ala Ala Asn Asp His Tyr Ser Ala lie 



( 
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1090 1095 1100 

Leu Val Lys Ala Leu Ala Asp Arg Phe Ala Glu Ala Leu Ala Glu Ala 
HOB 1110 ins 1120 

Leu His Gin Arg Val Arg Arg Asp Tyr Trp Gly Tyr Ala Pro Glu Glu 
1125 1130 U35 

Ser Phe Ala Pro Asp Gin Leu Val Gly Glu Pro Tyr Arg Gly He Arg 
1140 1145 1150 

Pro Ala Pro Gly Tyr Pro Ala Gin Pro Asp His Thr Glu Lys Leu Thr 
1155 1160 H65 

Leu Phe Arg Leu Leu Gly Ala Glu Ala Ala Thr Gly Val His Leu Thr 
1170 1175 H80 

Asp Ser Met Ala Met Trp Pro Gly Ser Ser Val Ser Gly Leu Tyr He 
1185 1190 1195 * 1200 

Gly His Pro Glu Ala Tyr Tyr Phe Gly Leu Ala Arg He Glu Gin Asp 
1205 1210 1215 

Gin Ala Ala Asp Tyr Ala Ala Arg Lys Gly Met Ala Leu Ala Glu Val 
1220 1225 1230 

Gin Arg Trp Leu Ala Pro Val Leu Gly Ser Ala Ala Pro Ala Ala Ala 
1235 1240 1245 

Ala Val Ala Ala 
1250 



<210> 43 

<211> 3798 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1),.(3795) 

<223> RHS24705 

<400> 43 

atg tea ccc gcg etc caa gac ctg teg caa ccc gaa ggt ctg aag aaa 48 

Met Ser Pro Ala Leu Gin Asp Leu Ser Gin Pro Glu Gly Leu Lye Lys 

1 5 10 is 

acc ctg egg gat gag ate aat gee att ctg cag aag agg att atg gtg 96 
Thr Leu Arg Asp Glu He Asn Ala He Leu Gin Lys Arg He Met Val 
20 25 30 

ctg gat gga ggg atg ggg acc atg ate cag egg gag aag eta aac gaa 144 
Leu Asp Gly Gly Met Gly Thr Met He Gin Arg Glu Lys Leu Asn Glu 
35 40 45 

gaa cac ttc cga ggt cag gaa ttt aaa gat cat gee agg ccg ctg aaa 192 
Glu His Phe Arg Gly Gin Glu Phe Lys Asp His Ala Arg Pro Leu Lys 
50 55 60 



ggc aac aat gac att tta agt ata act cag cct gat gtc att tac caa 240 
Gly Asn Asn Asp lie Leu Ser He Thr Gin Pro Asp Val He Tyr Gin 
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S3 70 75 80 

ate cat aag gaa tac ttg ctg get ggg gca gat ate att gaa aca aat 288 
He His Lye Glu Tyr Leu Leu Ala Gly Ala Asp He lie Glu, Thr Asn 
85 90 95 

act ttt age age act agt att gee caa get gac tat ggc ctt gaa cac 336 
Thr Phe Ser Ser Thr Ser He Ala Gin Ala Asp Tyr Gly Leu Glu His 
100 105 no 

ttg gee tac egg atg aac atg tgc tct gca gga gtg gec aga aaa get 384 
Leu Ala Tyr Arg Met Asn Met CyB Ser Ala Gly Val Ala Arg Lys Ala 
115 120 125 

gee gag gag gta act etc cag aca gga att aag agg ttt gtg gca ggg 432 
Ala Glu Glu Val Thr Leu Gin Thr Gly He Lys Arg Phe Val Ala Gly 
130 135 140 

get ctg ggt ccg act aat aag aca etc tct gtg tec cca tct gtg gaa 480 
Ala Leu Gly Pro Thr Asn Lys Thr Leu Ser Val Ser Pro Ser Val Glu 
145 150 155 160 

agg ccg gat tat agg aac ate aca ttt gat gag ctt gtt gaa gca tac 528 
Arg Pro Asp Tyr Arg Asn He Thr Phe Asp Glu Leu Val Glu Ala Tyr 
165 170 175 

caa gag cag gec aaa gga ctt ctg gat ggc ggg gtt gat ate tta etc 576 
Gin Glu Gin Ala LyB Gly Leu Leu Asp Gly Gly Val Asp He Leu Leu 
180 185 190 

att gaa act att ttt gat act gee aat gec aag gca gec ttg ttt gca 624 
He Glu Thr He Phe Asp Thr Ala Asn Ala Lys Ala Ala Leu Phe Ala 
195 200 205 

etc caa aat ctt ttt gag gag aaa tat get ccc egg cct ate ttt att 672 
Leu Gin Asn Leu Phe Glu Glu Lys Tyr Ala Pro Arg Pro He Phe He 
210 215 220 

tea ggg acg ate gtt gat aaa agt ggg egg act ctt tec gga cag aca 720 
Ser Gly Thr He Val Asp Lys Ser Gly Arg Thr Leu Ser Gly Gin Thr 
225 230 235 * 240 

gga gag gga ttt gtc ate age gtg tct cat gga gaa cca etc tgc att 768 
Gly Glu Gly Phe Val He Ser Val Ser His Gly Glu Pro Leu Cys He 
245 250 255 

gga tta aat tgt get ttg ggt gca get gaa atg aga cct ttt att gaa 816 
Gly Leu Asn Cys Ala Leu Gly Ala Ala Glu Met Arg Pro Phe He Glu 
260 265 270 

ata att gga aaa tgt aca aca gec tat gtc etc tgt tat ccc aat gca 864 
He He Gly Lys Cys Thr Thr Ala Tyr Val Leu Cys Tyr Pro Asn Ala 
275 280 285 



ggt ctt ccc aac acc ttt ggt gac tat gat gaa acg cct tct atg atg 
Gly Leu Pro Asn Thr Phe Gly Asp Tyr Asp Glu Thr Pro Ser Met Met 
290 295 300 



912 



gee aag cac eta aag gat ttt get atg gat ggc ttg gtc aat ata gtt 960 
Ala Lys His Leu Lys Asp Phe Ala Met Asp Gly Leu Val Asn He Val 
305 310 315 320 
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gga gga tgc tgt ggg tea aca cca gat cat ate agg gaa att get gaa 1008 
Gly Gly Cys Cys Gly Ser Thr Pro Asp Hia He Arg Glu He Ala Glu 
325 330 335 

get gtg aaa aat tgt aag cct aga gtt cca cct gee act get ttt gaa 1056 
Ala Val Lys Asn Cys LyB Pro Arg Val Pro Pro Ala Thr Ala Phe Glu 
340 345 350 

gga cat atg tta ctg tct ggt eta gag ccc ttc agg att gga ccg tac 1104 
Gly H1b Met Leu Leu Ser Gly Leu Glu Pro Phe Arg He Gly Pro Tyr 
355 360 365 

acc aac ttt gtt aae att gga gag cgc tgt aat gtt gca gga tea agg 1152 
Thr Asn Phe Val Asn He Gly Glu Arg Cys Asn Val Ala Gly Ser Arg 
370 375 380 

aag ttt get aaa etc ate atg gca gga aac tat gaa gaa gee ttg tgt 1200 
Lys Phe Ala Lys Leu He Met Ala Gly Asn Tyr Glu Glu Ala Leu Cys 
385 390 395 400 

gtt gee aaa gtg cag gtg gaa atg gga gee cag gtg ttg gat gtc aac 1248 
Val Ala Lys Val Gin Val Glu Met Gly Ala Gin Val Leu Asp Val Asn 
405 410 415 

atg gat gat ggc atg eta gat ggt cca agt gca atg acc aga ttt tgc 1296 
Met Asp Asp Gly Met Leu Asp Gly Pro Ser Ala Met Thr Arg Phe Cys 
420 425 430 

aac tta att get tec gag cca gac ate gca aag gta cct ttg tgc ate 1344 
Asn Leu He Ala Ser Glu Pro Asp He Ala Lys Val Pro Leu Cys He 
435 440 445 

gac tec tec aat ttt get gtg att gaa get ggg tta aag tgc tgc caa 1392 
Asp Ser Ser Asn Phe Ala Val He Glu Ala Gly Leu Lys Cys Cys Gin 
450 455 460 

ggg aag tgc att gtc aat age att agt ctg aag gaa gga gag gac gac 1440 
Gly Lys Cya He Val Asn Ser He Ser Leu Lys Glu Gly Glu Asp Asp 
465 470 475 480 

ttc ttg gag aag gee agg aag att aaa aag tat gga get get atg gtg 1488 
Phe Leu Glu Lys Ala Arg Lys He Lys Lys Tyr Gly Ala Ala Met Val 
405 490 495 

gtc atg get ttt gat gaa gaa gga cag gca aca gaa aca gac aca aaa 1536 
Val Met Ala Phe Asp Glu Glu Gly Gin Ala Thr Glu Thr Asp Thr Lys 
500 505 510 

ate aga gtg tgc acc egg gee tac cat ctg ctt gtg aaa aaa ctg ggc 1584 
He Arg Val Cys Thr Arg Ala Tyr His Leu Leu Val Lys Lys Leu Gly 
S15 520 525 

ttt aat cca aat gac att att ttt gac cct aat ate eta acc att ggg 1632 
Phe Asn Pro Asn Asp He He Phe Asp Pro Asn He Leu Thr He Gly 
530 535 540 

act gga atg gag gaa cac aac ttg tat gee att aat ttt ate cat gca 1680 
Thr Gly Met Glu Glu His Asn Leu Tyr Ala He Asn Phe lie His Ala 
545 550 555 560 

aca aaa gtc att aaa gaa aca tta cct gga gee aga ata agt gga ggt 1728 
Thr Lys Val He Lys Glu Thr Leu Pro Gly Ala Arg He Ser Gly Gly 
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565 570 . 575 

ctt tec aac ttg tec ttc tec ttc cga gga atg gaa gee att cga gaa 1776 
Leu Ser Aen Leu Ser Phe Ser Phe Arg Gly Met Glu Ala lie <Arg Glu 
580 585 590 

gca atg cat ggg gtt ttc ctt tac cat gca ate aag tct ggc atg gac 1824 
Ala Met His Gly Val Phe Leu Tyr His Ala He Lys Ser Gly Met Asp 
595 600 605 

atg ggg ata gtg aat get gga aac etc cct gtg tat gat gat ate cat 1872 
Met Gly He Val Asn Ala Gly Asn Leu Pro Val Tyr Asp Asp He His 
610 615 620 

aag gaa ctt ctg cag etc tgt gaa gat etc ate tgg aat aaa gac cct 1920 
Lys Glu Leu Leu Gin Leu Cys Glu Asp Leu He Trp Asn Lys Asp Pro 
625 630 635 640 

gag gee act gag aag etc tta cgt tat gec cag act caa ggc aca gga 1968 
Glu Ala Thr Glu Lys Leu Leu Arg Tyr Ala Gin Thr Gin Gly Thr Gly 
645 650 655 

ggg aag aaa gtc att cag act gat gag tgg aga aat ggc cct gtc gaa 2016 
Gly Lys Lys Val He Gin Thr Asp Glu Trp Arg Asn Gly Pro Val Glu 
660 665 670 

gaa cgc ctt gag tat gee ctt gtg aag ggc att gaa aaa cat att att 2064 
Glu Arg Leu Glu Tyr Ala Leu Val Lys Gly He Glu Lys His, He He 
675 680 685 

gag gat act gag gaa gee agg tta aac caa aaa aaa tat ccc cga cct 2112 
Glu Asp Thr Glu Glu Ala Arg Leu Asn Gin Lys Lys Tyr Pro Arg Pro 
690 695 700 

etc aat ata att gaa gga ccc ctg atg aat gga atg aaa att gtt ggt 2160 
Leu Asn He He Glu Gly Pro Leu Met Asn Gly Met Lys He Val Gly 
70S 710 715 720 

gat ctt ttt gga get gga aaa atg ttt eta cct cag gtt ata aag tea 2208 
Asp Leu Phe Gly Ala Gly Lys Met Phe Leu Pro Gin Val He Lys Ser 
725 730 735 

gee egg gtt atg aag aag get gtt ggc cac ctt ate cct ttc atg gaa 2256 
Ala Arg Val Met Lys Lys Ala Val Gly His Leu He Pro Phe Met Glu 
740 745 750 

aaa gaa aga gaa gaa ace aga gtg ctt aac ggc aca gta gaa gaa gag 2304 
Lys Glu Arg Glu Glu Thr Arg Val Leu Asn Gly Thr Val Glu Glu Glu 
755 760 76S 

gac cct tac cag ggc ace ate gtg ctg gee act gtt aaa ggc gac gtg 2352 
Asp Pro Tyr Gin Gly Thr He Val Leu Ala Thr Val Lys Gly Asp Val 
770 775 780 

cac gac ata ggc aag aac ata gtt gga gta gtc ctt ggc tgc aat aat 2400 
His Asp He Gly Lys Asn He Val Gly Val Val Leu Gly Cys Asn Asn 
785 790 795 800 

ttc cga gtt att gat tta gga gtc atg act cca tgt gat aag ata ctg 2448 
Phe Arg Val He Asp Leu Gly Val Met Thr Pro Cys Asp Lys He Leu 
805 810 815 
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aaa get get ctt gac cac aaa gca gat ata att ggc ctg tea gga etc 
Lye Ala Ala Leu Asp His Lys Ala Asp He He Gly Leu Ser Gly Leu 
820 825 830 



2496 



ate act cct tec ctg gat gaa atg att ttt gtt gee aag gaa atg gag 
lie Thr Pro Ser Leu Asp Glu Met He Phe Val Ala Lys Glu Met Glu 
835 840 845 



2544 



aga tta get ata agg att cca ttg ttg att gga gga gca ace act tea 
Arg Leu Ala He Arg He Pro Leu Leu He Gly Gly Ala Thr Thr Ser 
850 855 860 



2592 



aaa acc cac aca gca gtt aaa ata get ccg aga tac agt gca cct gta 
Lys Thr His Thr Ala Val Lys He Ala Pro Arg Tyr Ser Ala Pro Val 
865 870 875 880 



2640 



ate cat gtc ctg gac gcg tec aag agt gtg gtg gtg tgt tec cag ctg 
He His Val Leu Asp Ala Ser Lys Ser Val Val Val Cys Ser Gin Leu 
885 890 895 



2688 



tta gat gaa aat eta aag gat gaa tac ttt gag gaa ate atg gaa gaa 
Leu Asp Glu Asn Leu Lys Asp Glu Tyr Phe Glu Glu He Met Glu Glu 
900 905 910 



2736 



tat gaa gat att aga cag gac cat tat gag tct etc aag gag agg aga 
Tyr Glu Asp He Arg Gin Asp His Tyr Glu Ser Leu Lys Glu Arg Arg 
915 920 925 



2784 



tac tta ccc tta agt caa gee aga aaa agt ggt ttc caa atg gat tgg 
Tyr Leu Pro Leu Ser Gin Ala Arg Lys Ser Gly Phe Gin Met Asp Trp 
930 935 940 



2832 



ctg tct gaa cct cac cca gtg aag ccc acg ttt att ggg acc cag gtc 
Leu Ser Glu Pro His Pro Val Lys Pro Thr Phe He Gly Thr Gin Val 
945 950 955 * 960 



2880 



ttt gaa gac tat gac ctg cag aag ctg gtg gac tac att gac tgg aag 
Phe Glu Asp Tyr Asp Leu Gin Lys Leu Val Asp Tyr He Asp Trp Lys 
965 970 " 975 



2928 



cct ttc ttt gat gtc tgg cag etc egg ggc aag tac ccg aat cga ggc 
Pro Phe Phe Asp Val Trp Gin Leu Arg Gly Lys Tyr Pro Asn Arg Gly 
980 985 990 



2976 



ttt ccc aag ata ttt aac gac aaa aca gta ggt gga gag gee agg aag 
Phe Pro Lys He Phe Asn Asp Lys Thr Val Gly Gly Glu Ala Arg Lys 
995 1000 1005 



3024 



gtc tac gat gat gee cac aat atg ctg aac aca ctg att agt caa aag 
Val Tyr Asp Asp Ala His Asn Met Leu Asn Thr Leu He Ser Gin Lys 
1010 1015 1020 



3072 



aaa etc egg gee egg ggt gtg gtt ggg ttc tgg cca gca cag agt ate 
Lys Leu Arg Ala Arg Gly Val Val Gly Phe Trp Pro Ala Gin Ser He 
1025 1030 1035 1040 



3120 



caa gac gac att cac ctg tac gcg gag get get gtg ccc cag get gca 
Gin Asp Asp He His Leu Tyr Ala Glu Ala Ala Val Pro Gin Ala Ala 
1045 1050 1055 



3168 



gag ccc ata gee acc ttc tat ggg tta agg caa cag get gag aag gac 
Glu Pro He Ala Thr Phe Tyr Gly Leu Arg Gin Gin Ala Glu Lys Asp 



3216 
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1065 



1070 



tct gcc age acg gag cca tac tac tgc etc tea gac ttc ate get ccc 3264 
Ser Ala Ser Thr Glu Pro Tyr Tyr Cys Leu Ser Asp Phe. He Ala Pro 
1075 1080 1085 

ttg cat tct ggc ate cgt gac tac ctg ggc ctg ttt gcc gtt gcc tgc 3312 
Leu His Ser Gly He Arg Aep Tyr Leu Gly Leu Phe Ala Val Ala Cys 
1090 1095 1100 

ttt ggg gta gaa gag ctg age aag gcc tat gag gat gat ggt gac gac 3360 
Phe Gly Val Glu Glu Leu Ser Lys Ala Tyr Glu Asp Asp Gly Asp Asp 
1105 1110 1115 * 1120 

tac age age ate atg gtc aag gcg ctg ggg gac egg ctg gca gag gcc 3408 
Tyr Ser Ser He Met Val Lys Ala Leu Gly Asp Arg Leu Ala Glu Ala 
1125 1130 " 1135 

ttt gca gaa gag etc cat gaa aga gtt cgc cga gaa ctg tgg gcc tac 3456 
Phe Ala Glu Glu Leu His Glu Arg Val Arg Arg Glu Leu Trp Ala Tyr 
1140 1145 1150 

tgt ggc agt gag cag ctg gac gtc gca gac ctg cgc agg ctg egg tac 3504 
Cys Gly Ser Glu Gin Leu Asp Val Ala Abp Leu Arg Arg Leu Arg Tyr 
1155 1160 1165 

aag ggc ate cgc ccg get cct ggc tac ccc age cag ccc gac cac ace 3552 
Lys Gly He Arg Pro Ala Pro Gly Tyr Pro Ser Gin Pro Asp His Thr 
1170 1175 1180 

gag aag etc acc atg tgg aga ctt gca gac ate gag cag tct aca ggc 3600 
Glu Lys Leu Thr Met Trp Arg Leu Ala Asp He Glu Gin Ser Thr Gly 
1185 1190 1195 1200 

att agg tta aca gaa tea tta gca atg gca cct get tea gca gtc tea 3648 
He Arg Leu Thr Glu Ser Leu Ala Met Ala Pro Ala Ser Ala Val Ser 
1205 1210 1215 

ggc etc tac ttc tec aat ttg aag tec aaa tat ttt get gtg ggg aag 3696 
Gly Leu Tyr Phe Ser Asn Leu Lys Ser Lys Tyr Phe Ala Val Gly Lys 
1220 1225 1230 

att tec aag gat cag gtt gag gat tat gca ttg agg aag aac ata tct 3744 
He Ser Lys Asp Gin Val Glu Asp Tyr Ala Leu Arg Lys Asn He Ser 
1235 1240 1245 

gtg get gag gtt gag aaa tgg ctt gga ccc att ttg gga tat gat aca 3792 
Val Ala Glu Val Glu Lys Trp Leu Gly Pro He Leu Gly Tyr Asp Thr 
1250 1255 1260 



gac taa 

Asp 

1265 



3798 



<210> 44 
<211> 1265 
<212> PRT 

<213> Homo sapiens 



<400> 44 

Met Ser Pro Ala Leu Gin Asp Leu Ser Gin Pro Glu Gly Leu Lys Lys 
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1 



5 



10 



15 



Thr Leu Arg Asp Glu lie Asn Ala He Leu Gin Lys Arg He Met Val 
20 25 30 

Leu Asp Gly Gly Met Gly Thr Met He Gin Arg Glu LyB Leu Asn Glu 
35 40 45 

Glu His Phe Arg Gly Gin Glu Phe Lys Asp His Ala Arg Pro Leu Lys 
50 55 60 

Gly Asn Asn Asp He Leu Ser He Thr Gin Pro Asp Val He Tyr Gin 
65 70 75 80 

He His Lys Glu Tyr Leu Leu Ala Gly Ala Asp He He Glu Thr Asn 
85 90 95 

Thr Phe Ser Ser Thr Ser He Ala Gin Ala Asp Tyr Gly Leu Glu His 
100 105 110 

Leu Ala Tyr Arg Met Asn Met Cys Ser Ala Gly Val Ala Arg Lys Ala 
115 120 125 

Ala Glu Glu Val Thr Leu Gin Thr Gly He Lys Arg Phe Val Ala Gly 
130 135 140 

Ala Leu Gly Pro Thr Asn Lys Thr Leu Ser Val Ser Pro Ser Val Glu 
145 150 155 160 

Arg Pro Asp Tyr Arg Asn He Thr Phe Asp Glu Leu Val Glu Ala Tyr 
165 170 175 

Gin Glu Gin Ala Lys Gly Leu Leu Asp Gly Gly Val Asp He Leu Leu 
180 185 190 

He Glu Thr He Phe Asp Thr Ala Asn Ala Lys Ala Ala Leu Phe Ala 
195 200 205 

Leu Gin Asn Leu Phe Glu Glu Lys Tyr Ala Pro Arg Pro He Phe He 
210 215 220 

Ser Gly Thr He Val Asp Lys Ser Gly Arg Thr Leu Ser Gly Gin Thr 
225 230 235 240 

Gly Glu Gly Phe Val He Ser Val Ser His Gly Glu Pro Leu Cys He 
245 250 255 

Gly Leu Asn Cys Ala Leu Gly Ala Ala Glu Met Arg Pro Phe He Glu 
260 265 ~ 270 

He He Gly Lys Cys Thr Thr Ala Tyr Val Leu Cys Tyr Pro Asn Ala 
275 280 285 

Gly Leu Pro Asn Thr Phe Gly Asp Tyr Asp Glu Thr Pro Ser Met Met 
290 295 300 

Ala Lys His Leu Lys Asp Phe Ala Met Asp Gly Leu Val Asn He Val 
305 310 315 320 

Gly Gly Cys Cys Gly Ser Thr Pro Asp His He Arg Glu He Ala Glu 



325 



330 



335 
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Ala Val Lys Asn Cys Lys Pro Arg Val Pro Pro Ala Thr Ala Phe Glu 
340 345 350 

Gly His Met Leu Leu Ser Gly Leu Glu Pro Phe Arg He Gly. Pro Tyr 
355 360 365 

Thr Asn Phe Val Asn He Gly Glu Arg Cys Asn Val Ala Gly Ser Arg 
370 375 360 

Lys Phe Ala Lys Leu He Met Ala Gly. Asn Tyr Glu Glu Ala Leu Cys 
385 390 395 400 

Val Ala Lys Val Gin Val Glu Met Gly Ala Gin Val Leu Asp Val Asn 
405 410 415 

Met Asp Asp Gly Met Leu Asp Gly Pro Ser Ala Met Thr Arg Phe Cye 
420 425 430 

Asn Leu He Ala Ser Glu Pro Asp He Ala Lys Val Pro Leu Cys He 
435 440 445 

Asp Ser Ser Asn Phe Ala Val He Glu Ala Gly Leu Lys Cys Cye Gin 
450 455 460 

Gly Lys Cys He Val Asn Ser He Ser Leu Lys Glu Gly Glu Asp Asp 
465 470 475 480 

Phe Leu Glu Lys Ala Arg Lys He Lys Lys Tyr Gly Ala Ala Met Val 
485 490 495 

Val Met Ala Phe Asp Glu Glu Gly Gin Ala Thr Glu Thr Asp Thr Lye 
500 505 510 

He Arg Val Cys Thr Arg Ala Tyr His Leu Leu Val Lys LyB Leu Gly 
515 520 525 

Phe Asn Pro Asn Asp He He Phe Asp Pro Asn He Leu Thr He Gly 
530 535 540 

Thr Gly Met Glu Glu His Asn Leu Tyr Ala He Asn Phe He His Ala 
545 550 555 560 

Thr Lys Val He Lys Glu Thr Leu Pro Gly Ala Arg He Ser Gly Gly 
565 570 575 

Leu Ser Asn Leu Ser Phe Ser Phe Arg Gly Met Glu Ala He Arg Glu 
580 585 590 

Ala Met His Gly Val Phe Leu Tyr His Ala He Lys Ser Gly Met Asp 
595 600 60S 

Met Gly He Val Asn Ala Gly Asn Leu Pro Val Tyr Asp Asp He His 
610 615 620 

Lys Glu Leu Leu Gin Leu Cys Glu Asp Leu He Trp Asn Lys Asp Pro 
625 630 635 * 640 

Glu Ala Thr Glu Lys Leu Leu Arg Tyr Ala Gin Thr Gin Gly Thr Gly 
645 650 655 

Gly Lys Lys Val He Gin Thr Asp Glu Trp Arg Asn Gly Pro Val Glu 

665 " 670 
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Glu Arg Leu Glu Tyr Ala Leu Val Lys Gly He Glu Lys His lie He 
675 680 685 

Glu Asp Thr Glu Glu Ala Arg Leu Asn Gin Lys Lys Tyr Pro Arg Pro 
690 695 700 

Leu Asn He He Glu Gly Pro Leu Met Asn Gly Met Lys He Val Gly 
70S 710 715 720 

Asp Leu Phe Gly Ala Gly Lys Met Phe Leu Pro Gin Val He Lys Ser 
725 730 735 

Ala Arg Val Met Lys Lys Ala Val Gly His Leu He Pro Phe Met Glu 
740 745 750 

Lys Glu Arg Glu Glu Thr Arg Val Leu Asn Gly Thr Val Glu Glu Glu 
755 760 765 

Asp Pro Tyr Gin Gly Thr He Val Leu Ala Thr Val Lys Gly Asp Val 
770 775 780 

His Asp He Gly Lys Asn He Val Gly Val Val Leu Gly Cys Asn Asn 
785 790 795 ' 800 

Phe Arg Val He Asp Leu Gly Val Met Thr Pro Cys Asp Lys He Leu 
805 810 815 

Lys Ala Ala Leu Asp His Lys Ala Asp He He Gly Leu Ser Gly Leu 
820 825 830 

He Thr Pro Ser Leu Asp Glu Met He Phe Val Ala Lys Glu Met Glu 
635 840 845 

Arg Leu Ala He Arg He Pro Leu Leu He Gly Gly Ala Thr Thr Ser 
850 855 860 

Lys Thr His Thr Ala Val Lys He Ala Pro Arg Tyr Ser Ala Pro Val 
865 870 875 880 

He His Val Leu Asp Ala Ser Lys Ser Val Val Val Cys Ser Gin Leu 
885 890 895 

Leu Asp Glu Asn Leu Lys Asp Glu Tyr Phe Glu Glu He Met Glu Glu 
900 90S 910 

Tyr Glu Asp He Arg Gin Asp His Tyr Glu Ser Leu Lys Glu Arg Arg 
915 920 925 

Tyr Leu Pro Leu Ser Gin Ala Arg Lys Ser Gly Phe Gin Met Asp Trp 
930 935 940 

Leu Ser Glu Pro His Pro Val Lys Pro Thr Phe He Gly Thr Gin Val 
945 950 955 960 

Phe Glu Asp Tyr Asp Leu Gin Lys Leu Val Asp Tyr He Asp Trp Lys 
965 970 975 

Pro Phe Phe Asp Val Trp Gin Leu Arg Gly Lys Tyr Pro Asn Arg Gly 
980 985 990 

Phe Pro Lys lie Phe Asn Asp Lys Thr Val Gly Gly Glu Ala Arg Lys 
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995 1000 1005 

Val Tyr Asp Asp Ala Hie Asn Met Leu Asn Thr Leu He Ser Gin Lys 
1010 1015 1020 

Lys Leu Arg Ala Arg Gly Val Val Gly Phe Trp Pro Ala Gin Ser He 
1025 1030 1035 1040 

Gin Asp Asp He His Leu Tyr Ala Glu Ala Ala Val Pro Gin Ala Ala 
1045 1050 ,1055 

Glu Pro He Ala Thr Phe Tyr Gly Leu Arg Gin Gin Ala Glu Lys Asp 
1060 1065 1070 

Ser Ala Ser Thr Glu Pro Tyr Tyr Cys Leu Ser Asp Phe He Ala Pro 
1075 1080 1085 

Leu His Ser Gly He Arg Asp Tyr Leu Gly Leu Phe Ala Val Ala Cys 
1090 1095 1100 

Phe Gly Val Glu Glu Leu Ser Lys Ala Tyr Glu Asp Asp Gly Asp Abp 
H05 1110 1H5 1120 

Tyr Ser Ser He Met Val Lys Ala Leu Gly Asp Arg Leu Ala Glu Ala 
1125 1130 1135 

Phe Ala Glu Glu Leu His Glu Arg Val Arg Arg Glu Leu Trp Ala Tyr 
1140 1145 1150 

Cys Gly Ser Glu Gin Leu Asp Val Ala Asp Leu Arg Arg Leu Arg Tyr 
H55 1160 1165 

Lys Gly He Arg Pro Ala Pro Gly Tyr Pro Ser Gin Pro Asp His Thr 
1170 1175 1180 

Glu Lys Leu Thr Met Trp Arg Leu Ala Asp He Glu Gin Ser Thr Gly 
H85 1190 1195 1200 

He Arg Leu Thr Glu Ser Leu Ala Met Ala Pro Ala Ser Ala Val Ser 
1205 1210 1215 

Gly Leu Tyr Phe Ser Asn Leu Lys Ser Lys Tyr Phe Ala Val Gly Lys 
1220 1225 1230 

He Ser Lys Asp Gin Val Glu Asp Tyr Ala Leu Arg Lys Asn He Ser 
1235 1240 1245 

Val Ala Glu Val Glu Lys Trp Leu Gly Pro He Leu Gly Tyr Asp Thr 
1250 1255 1260 

Asp 
1265 



<210> 45 
<211> 3681 
<212> DNA 

<213> Vibrio fisheri 

<220> 

<221> CDS 

<222> (1)..(3678) 
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<223> AB039955 
<400> 45 

gtg gca gga age aat ata aaa gta caa ata gaa aag caa ctt tea gag 48 
Val Ala Gly Ser Asn He Lye Val Gin He Glu Lys Gin Leu Ser Glu 
1 5 10 15 

cga att tta ttg att gat ggt ggt atg ggc ace atg att caa ggt tat 96 
Arg He Leu Leu He Asp Gly Gly Met Gly Thr Met He Gin Gly Tyr 
20 25 30 

aag ttt gaa gag aaa gat tat aga ggg gga cgc ttt aat caa tgg cat 144 
Lys Phe Glu Glu LyB Asp Tyr Arg Gly Gly Arg Phe Asn Gin Tip His 
35 40 45 

tgt gat ctt aaa ggt aac aat gat tta tta gtt ctt tea caa cca caa 192 
Cys Asp Leu Lys Gly Asn Asn Asp Leu Leu Val Leu Ser Gin Pro Gin 
50 55 60 

att ata aga gat ata cac gaa gee tat tta gaa get ggt get gat ate 240 
He He Arg Asp He His Glu Ala Tyr Leu Glu Ala Gly Ala Asp He 
65 70 75 80 

ctt gaa act aat ace ttt aat gca aca act att get atg get gat tat 288 
Leu Glu Thr Asn Thr Phe Asn Ala Thr Thr He Ala Met Ala Asp Tyr 
85 90 95 

gat atg gaa age ctt agt gaa gag att aac ttt gaa gca gca aag ctt 336 
Asp Met Glu Ser Leu Ser Glu Glu He Asn Phe Glu Ala Ala Lys Leu 
100 105 no 



get cgt gaa gtt gca gat aaa tgg aca gaa aaa aca cca aac aaa cct 
Ala Arg Glu Val Ala Asp Lys Trp Thr Glu Lys Thr Pro Asn Lys Pro 
115 120 125 



384 



cgc tat gta gca gga gtg ctt gga cca aca aat cga act tgt tct att 432 
Arg Tyr Val Ala Gly Val Leu Gly Pro Thr Asn Arg Thr Cys Ser He 
130 135 140 

tct cca gac gta aat gac cct ggc ttt cgt aat gta teg ttt gat gaa 480 
Ser Pro Asp Val Asn Asp Pro Gly Phe Arg Asn Val Ser Phe Asp Glu 
145 150 155 160 

tta gtc gaa get tat tea gag tea act cga gca ctt att aga ggt ggt 528 
Leu Val Glu Ala Tyr Ser Glu Ser Thr Arg Ala Leu He Arg Gly Gly 
165 170 175 

tea gat ctt ate etc ate gaa act ata ttt gat aca tta aat get aaa 576 
Ser Asp Leu He Leu He Glu Thr He Phe Asp Thr Leu Asn Ala Lys 
180 185 190 

gcg tgt tct ttt get gtt gaa tct gtt ttt gaa gag ctt ggt att act 624 
Ala Cys Ser Phe Ala Val Glu Ser Val Phe Glu Glu Leu Gly He Thr 
195 200 205 

ttg cct gtt atg att tea ggg acc att acc gat gca tea gga aga aca 672 
Leu Pro Val Met He Ser Gly Thr He Thr Asp Ala Ser Gly Arg Thr 
210 215 220 

tta teg ggg caa aca aca gaa get ttt tat aat gca tta aga cat gta 720 
Leu Ser Gly Gin Thr Thr Glu Ala Phe Tyr Asn Ala Leu Arg His Val 
225 230 235 " 240 
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aaa cct att tct ttt ggt ctt aac tgt gca ctt ggt cct gat gaa tta 768 
Lyu Pro lie Ser Phe Gly Leu Asn Cys Ala Leu Gly Pro Asp Glu Leu 
245 250 ,255 

cgt gaa tat gta age gag ctt tea cgt att tct gaa tgt tat gtt tct B16 
Arg Glu Tyr Val Ser Glu Leu Ser Arg He Ser Glu Cys Tyr Val Ser 
260 265 270 

gcg cac cca aac get ggt ttg cct aat gca ttt ggt gag tat gat tta 864 
Ala Hie Pro Asn Ala Gly Leu Pro Asn Ala Phe Gly Glu Tyr Asp Leu 
275 280 285 

tct ccc gaa gat atg get gag cat gtt gcg gaa tgg gca age age gga 912 
Ser Pro Glu Asp Met Ala Glu His Val Ala Glu Trp Ala Ser Ser Gly 
290 295 300 

ttt tta aat ctt att ggt ggg tgt tgt ggc acc act cct gaa cat att 960 
Phe Leu Asn Leu He Gly Gly Cys Cys Gly Thr Thr Pro Glu His He 
305 310 315 320 

cgt caa atg get tta gtt gtt gaa ggt gtg aaa cct cga caa tta cct 1008 
Arg Gin Met Ala Leu Val Val Glu Gly Val Lys Pro Arg Gin Leu Pro 
325 330 335 

gaa tta ccc gta get tgt cgt ctt tec gga tta gag cct tta aca ata 1056 
Glu Leu Pro Val Ala Cys Arg Leu Ser Gly Leu Glu Pro Leu Thr He 
340 345 350 

gaa aaa gat tct ttg ttt att aat gtt ggt gaa cgt aca aat gtt act 1104 
Glu Lys Asp Ser Leu Phe He Asn Val Gly Glu Arg Thr ABn Val Thr 
355 360 365 

gga tct gca cgt ttt aaa cgc tta att aaa gaa gag ctt tat gac gaa 1152 
Gly Ser Ala Arg Phe Lys Arg Leu He Lys Glu Glu Leu Tyr Asp Glu 
370 375 380 

gca eta agt gtt get caa gag caa gtt gaa aac ggt get caa att ate 1200 
Ala Leu Ser Val Ala Gin Glu Gin Val Glu Asn Gly Ala Gin He He 
385 390 395 400 

gat ate aac atg gat gaa ggc atg ctt gat get gaa gca tgt atg gtt 1248 
Asp He Asn Met Asp Glu Gly Met Leu Asp Ala Glu Ala Cys Met Val 
405 410 415 

cgt ttt tta aat ctt tgt gca tea gaa cct gaa ata tct aaa gta cca 1296 
Arg Phe Leu Asn Leu Cys Ala Ser Glu Pro Glu He Ser Lys Val Pro 
420 425 430 

gtg atg gtt gat tct tct aaa tgg gaa gta att gaa get gga tta aag 1344 
Val Met Val Asp Ser Ser Lys Trp Glu Val He Glu Ala Gly Leu Lys 
435 440 445 

tgt att caa ggt aag ggg ata gtt aat tea ate tct tta aag gaa ggc 1392 
Cys He Gin Gly Lys Gly He Val Asn Ser He Ser Leu Lys Glu Gly 
450 455 460 

aaa gaa aag ttt gta cat caa gee aag tta ata cgt cgt tat ggt get 1440 
Lys Glu Lys Phe Val His Gin Ala Lys Leu He Arg Arg Tyr Gly Ala 
465 470 475 480 

gca gtg ate gtt atg get ttt gat gaa gtt ggc caa gcg gac act egg 1488 
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Ala Val He Val Met Ala Phe Asp Glu Val Gly Gin Ala Asp Thr Arg 
485 490 495 

gag cgt aaa att gaa att tgt acc aat gcc tac aat att tta gtt gat 1536 
Glu Arg Lys He Glu He Cys Thr Asn Ala Tyr Asn He Leu Val Asp 
500 505 510 

gaa gtt ggc ttc cca cct gaa gat att att ttt gac cct aat att ttt 1584 
Glu Val Gly Phe Pro Pro Glu Asp He He Phe Asp Pro Asn He Phe 
515 520 525 

gcg gtt get aca ggt ate gat gaa cat aat aac tat gca gta gac ttt 1632 
Ala Val Ala Thr Gly He Asp Glu His Asn Asn Tyr Ala Val Asp Phe 
530 535 540 

att gaa gcc gtt ggt gat ata aag cga acg ctt cct cat gca atg att 1680 
He Glu Ala Val Gly Asp He Lys Arg Thr Leu Pro His Ala Met He 
545 550 555 560 

tea ggt ggt gtt tct aac gtc tct ttt tct ttc cgt gga aat aac tac 1728 
Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Asn Tyr 
565 570 575 

gtt cgt gaa get ate cat gcc gta ttt tta tat cac tgt ttt aaa aat 1776 
Val Arg Glu Ala He His Ala Val Phe Leu Tyr His Cys Phe Lys Asn 
580 585 590 

ggt atg gat atg ggc ate gta aat gcg ggg cag ctg gaa ata tat gat 1824 
Gly Met Asp Met Gly He Val Asn Ala Gly Gin Leu Glu He Tyr Asp 
595 600 60S 

aac gta cca gaa gat ctg cgt gaa gcg gtt gaa gat gtg gta ttg aat 1872 
Asn Val Pro Glu Asp Leu Arg Glu Ala Val Glu Asp Val Val Leu Asn 
610 615 620 

cgt cga gat gat tct acg gag cgt tta ctt gat att gca act gag tat 1920 
Arg Arg Asp Asp Ser Thr Glu Arg Leu Leu Asp He Ala Thr Glu Tyr 
625 630 635 640 

tta gaa cga get gtt ggt aaa gtt gaa gat aaa tct get tta gag tgg 1968 
Leu Glu Arg Ala Val Gly Lys Val Glu Asp Lys Ser Ala Leu Glu Trp 
645 650 655 

cgt gac tgg cct gtt gaa aaa cgt ctt gag cat tct eta gtg aag ggg 2016 
Arg Asp Trp Pro Val Glu Lys Arg Leu Glu His Ser Leu Val Lys Gly 
660 665 670 

ata aca gag ttt att gtc gaa gat aca gaa gaa gca cga ate aat gca 2064 
He Thr Glu Phe He Val Glu Asp Thr Glu Glu Ala Arg He Asn Ala 
675 680 685 

gaa aga cca ata gag gta att gaa ggg cca ttg atg gac gga atg aac 2112 
Glu Arg Pro He Glu Val He Glu Gly Pro Leu Met Asp Gly Met Asn 
690 695 700 

gtc gtt ggt gat ctt ttt ggg gaa gga aaa atg ttc ctt ccc caa gta 2160 
Val Val Gly Asp Leu Phe Gly Glu Gly Lys Met Phe Leu Pro Gin Val 
705 710 715 720 



gta aag tct get cgt gta atg aaa caa get gtt get cat tta gaa ccg 
Val Lys Ser Ala Arg Val Met Lys Gin Ala Val Ala His Leu Glu Pro 
725 730 735 



2208 
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ttt att aat gcg tct aaa gaa gtt gga gca aca aac ggt aaa ata ctt 2256 
Phe lie Asn Ala Ser Lys Glu Val Oly Ala Thr Asn Gly Lys lie Leu 
740 745 750. 

tta gca aca gta aaa ggt gat gtt cat gat att ggt aag aat ate gtt 2304 
Leu Ala Thr Val Lys Gly Asp Val His Asp lie Gly Lys Asn lie Val 
755 760 765 

ggc gtg gtt tta cag tgt aat aac tat. gaa ata att gat ctt ggt gtc 2352 
Gly Val Val Leu Gin Cys Asn Asn Tyr Glu He He Asp Leu Gly Val 
770 775 7B0 

atg gtc tct tgt gaa act ate tta aaa gta gec aaa gaa gaa aat gta 2400 
Met Val Ser Cys Glu Thr He Leu Lys Val Ala Lys Glu Glu Asn Val 
785 790 795 600 

gac ate att ggt tta tct gga tta ata aca cca tea tta gat gaa atg 2448 
Asp He He Gly Leu Ser Gly Leu lie Thr Pro Ser Leu Asp Glu Met 
805 810 815 

gtc cat gtt get aaa gag atg gaa cga caa ggg ttt gat tta cca ttg 2496 
Val His Val Ala Lys Glu Met Glu Arg Gin Gly Phe Asp Leu Pro Leu 
820 825 830 

ttg att ggt gga gca aca act tea aaa gca cat aca gcg gta aaa att 2544 
Leu lie Gly Gly Ala Thr Thr Ser Lys Ala His Thr Ala Val Lys He 
835 840 845 

gaa caa aac tat tct caa cct gtt gtg tac gtt aat aat get tct cga 2592 
Glu Gin Asn Tyr Ser Gin Pro Val Val Tyr Val Asn Asn Ala Ser Arg 
850 855 860 

get gta ggt gta tgt act tea tta ctt tea aat gaa eta aaa cct tct 2640 
Ala Val Gly Val Cys Thr Ser Leu Leu Ser Asn Glu Leu Lys Pro Ser 
865 870 875 880 

ttt gtt gag aag eta gat att gat tac gaa cgt gtt aga gag cag cat 2688 
Phe Val Glu Lys Leu Asp He Asp Tyr Glu Arg Val Arg Glu Gin His 
885 890 895 

agt cgt aaa caa ccg cga act aag cct gtg act tta gag gtt get cga 2736 
Ser Arg Lys Gin Pro Arg Thr Lys Pro Val Thr Leu Glu Val Ala Arg 
900 905 910 



gcg aat aaa gtc get att gac tgg get tct tat aca cct cct gtc cca 
Ala ABn Lys Val Ala He Asp Trp Ala Ser Tyr Thr Pro Pro Val Pro 
915 920 925 



2784 



eta aag cct ggt gta cat ata ttt gat aac ttt gat gtt tea aca ttg 2832 
Leu Lys Pro Gly Val His He Phe Asp Asn Phe Asp Val Ser Thr Leu 
930 935 940 



cgt aat tat att gat tgg ace cca ttt ttt atg acg tgg tct ctt gtt 
Arg Asn Tyr He Asp Trp Thr Pro Phe Phe Met Thr Trp Ser Leu Val 
945 950 955 960 



2880 



gga aaa tac ccg aag ate tta gag cat gaa gaa gtt ggt gaa gaa gee 2928 

Gly Lys Tyr Pro Lys He Leu Glu His Glu Glu Val Gly Glu Glu Ala 

965 970 975 

aaa cga tta ttt aaa gat gca aat gat eta tta gat cga gtt gaa aaa 2976 
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Lye Arg Leu Phe Lys Asp Ala Asn Asp Leu Leu Asp Arg Val Glu Lvs 
980 985 * 990 

gaa ggg tta ctt aaa gcc cgt gga atg tgt gcg eta ttt cca get tec 
Glu Gly Leu Leu Lys Ala Arg Gly Met Cys Ala Leu Phe Pro Ala Ser 
99S 1000 loos 



3024 



11 ! at 9at att gaa 9ta tat act 9 at 9aa tea cgc act aca 3072 

Ser Val Gly Asp Asp lie Glu Val Tyr Thr Asp Glu Ser Arg Thr Thr 
1010 1015 1020 

gtt gca aaa gta ctt cat aat ttg cga caa caa acg gag aag ccg aaa 3120 
Val Ala Lys Val Leu His Asn Leu Arg Gin Gin Thr Glu Lyl Pro Lys 
1025 1030 loss j* 40 

r? fc ^ Sat l at t9t " a tCt 9at tat ata 9 ca ccc aaa 9*9 teg ggt 3168 
Gly Phe Asn Tyr Cys Leu Ser Asp Tyr lie Ala Pro Lys Glu Ser Gly 

10 « 1050 loss 

aaa aat gat tgg ate ggt ggt ttt get gta act ggt ggt att ggt gag 3216 
Lys Asn Asp Trp lie Gly Gly Phe Ala Val Thr Gly Gly lie Gly Glu 
106 0 1065 1070 

cgt gaa eta get gat gaa tat aaa- gca aat ggt gat gat tat aac get 3264 
Arg Glu Leu Ala Asp Glu Tyr Lys Ala Asn Gly Asp Asp Tyr Asn Ala 
1075 1080 1085 

ate atg att caa gcg gtg get gat cgt eta get gaa get ttt get gaa 3312 
lie Met He Gin Ala Val Ala Asp Arg Leu Ala Slu Ala Phe Ala 111 
1090 1095 iioo 

tat tta cat gaa aaa gta cgt aag gaa att tgg ggt tac tct cct aat 3360 
Tyr Leu His Glu Lys Val Arg Lys Glu He Trp Gly Tyr Ser Pro Asn 
1105 1110 HIS 1120 

2 39 "f 9 w C " aat 93t 93t tta 3tC C9t 9 aa aaa tac caa 99C att 3408 

Glu Thr Leu Ser Asn Asp Asp Leu He Arg Glu Lys Tyr Gin Gly He 

H25 mo 1135 

cgt cct get cct ggt tac cca get tgt cct gaa cat aca gaa aaa ggg 3455 
Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Glu His Thr Glu Lys Glv 
H*0 1145 iiso 

get tta tgg gag tta atg aat gtt gaa gaa tct att gga atg tct tta 3504 
Ala Leu Trp Glu Leu Met Asn Val Glu Glu Ser He Gly Met Ser Leu 
1155 H60 U65 

aca tea age tat gca atg tgg ccc ggt gca tct gtg tea gga atg tat 3552 
Thr Ser Ser Tyr Ala Met Trp Pro Gly Ala Ser Val Ser Gly Met Tyr 
1170 1175 iiao 

ttt tea cac cca gat tct cgt tat ttt gcg att get cag att cag caa 3600 
Phe Ser His Pro Asp Ser Arg Tyr Phe Ala He Ala Gin He Gin Gin 
1185 1»0 1195 1200 

lit f?° o 90 t3t 9CC 93t ° 9t 333 99t t99 aat at 9 «t gaa 3648 
Asp Gin Ala Glu Ser Tyr Ala Asp Arg Lys Gly Trp Asn Met Leu Glu 

1205 1210 12 i5 

get gag aag tgg tta ggt cca aat ttg aat taa 36 ai 
Ala Glu Lys Trp Leu Gly Pro Asn Leu Asn 
1220 1225 
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<210> 46 
<211> 1226 
<212> PRT 

<213> Vibrio fisheri 
<400> 46 

Val Ala Gly Ser Asn He Lys Val Gin He Glu Lys Gin Leu Ser Glu 
1 5 10 15 

Arg He Leu Leu He Asp Gly Gly Met Gly Thr Met He Gin Gly Tyr 
20 25 30 

Lys Phe Glu Glu Lys Asp Tyr Arg Gly Gly Arg Phe Asn Gin Trp His 
35 40 45 

Cys Asp Leu Lys Gly Asn Asn Asp Leu Leu Val Leu Ser Gin Pro Gin 
50 55 60 

He He Arg Asp He His Glu Ala Tyr Leu Glu Ala Gly Ala Asp He 
€S 70 75 80 

Leu Glu Thr Asn Thr Phe Asn Ala Thr Thr He Ala Met Ala Asp Tyr 
85 90 95 

Asp Met Glu Ser Leu Ser Glu Glu He Asn Phe Glu Ala Ala Lys Leu 
100 105 no, 

Ala Arg Glu Val Ala Asp LyB Trp Thr Glu Lys Thr Pro Asn Lys Pro 
115 120 125 

Arg Tyr Val Ala Gly Val Leu Gly Pro Thr Asn Arg Thr Cys Ser He 
130 135 140 

Ser Pro Asp Val Asn Asp Pro Gly Phe Arg Asn Val Ser Phe Asp Glu 
150 155 160 

Leu Val Glu Ala Tyr Ser Glu Ser Thr Arg Ala Leu He Arg Gly Gly 
165 170 175 

Ser Asp Leu He Leu He Glu Thr He Phe Asp Thr Leu Asn Ala Lys 
180 185 190 

Ala Cys Ser Phe Ala Val Glu Ser Val Phe Glu Glu Leu Gly He Thr 
195 200 205 

Leu Pro Val Met He Ser Gly Thr He Thr Asp Ala Ser Gly Arg Thr 
210 215 220 

Leu Ser Gly Gin Thr Thr Glu Ala Phe Tyr Asn Ala Leu Arg His Val 
225 230 235 240 

LyB Pro He Ser Phe Gly Leu Asn Cys Ala Leu Gly Pro Asp Glu Leu 
2 *5 250 255 

Arg Glu Tyr Val Ser Glu Leu Ser Arg He Ser Glu Cys Tyr Val Ser 
260 265 270 

Ala His Pro Asn Ala Gly Leu Pro Asn Ala Phe Gly Glu Tyr Asp Leu 
275 280 285 
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Ser Pro Glu Asp Met Ala Glu His Val Ala Glu Trp Ala Ser Ser Gly 
290 295 300 

Phe Leu Asn Leu He Gly Gly Cys Cy B Gly Thr Thr Pro Glu His He 
3 <>5 310 315 320 

Arg Gin Met Ala Leu Val Val Glu Gly Val Lys Pro Arg Gin Leu Pro 
325 330 335 

Glu Leu Pro Val Ala Cys Arg Leu Ser Gly Leu Glu Pro Leu Thr He 
340 345 350 

Glu Lys Asp Ser Leu Phe He Asn Val Gly Glu Arg Thr Asn Val Thr 
355 360 365 

Gly Ser Ala Arg Phe Lys Arg Leu He Lys Glu Glu Leu Tyr Asp Glu 
370 375 380 

Ala Leu Ser Val Ala Gin Glu Gin Val Glu Asn Gly Ala Gin He He 
385 390 395 400 

Asp He Asn Met Asp Glu Gly Met Leu Asp Ala Glu Ala Cys Met Val 
405 410 415 

Arg Phe Leu Asn Leu Cys Ala Ser Glu Pro Glu He Ser Lys Val Pro 
420 425 430 

val Met Val Asp Ser Ser Lys Trp Glu Val He Glu Ala Gly Leu Lys 
435 440 445 

Cys He Gin Gly Lys Gly He Val Asn Ser He Ser Leu Lye Glu Gly 
450 455 460 

Lys Glu Lys Phe Val His Gin Ala Lys Leu He Arg Arg Tyr Gly Ala 
465 470 475 480 

Ala Val He Val Met Ala Phe Asp Glu Val Gly Gin Ala Asp Thr Arg 
485 490 495 

Glu Arg Lys He Glu He Cys Thr Asn Ala Tyr Asn He Leu Val Asp 
500 505 510 

Glu Val Gly Phe Pro Pro Glu Asp He He Phe Asp Pro Asn He Phe 
515 520 525 

Ala Val Ala Thr Gly He Asp Glu His Asn Asn Tyr Ala Val Asp Phe 
530 535 540 

He Glu Ala Val Gly Asp lie Lys Arg Thr Leu Pro His Ala Met lie 
545 550 555 560 

Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Asn Tyr 
565 570 575 

Val Arg Glu Ala lie His Ala Val Phe Leu Tyr His Cys Phe Lys Asn 
580 585 590 

Gly Met Asp Met Gly lie Val Asn Ala Gly Gin Leu Glu lie Tyr Asp 
595 600 605 

Asn Val Pro Glu Asp Leu Arg Glu Ala Val Glu Asp Val Val Leu Asn 
610 615 620 
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Arg Arg Aep Asp Ser Thr Glu Arg Leu Leu Asp He Ala Thr Glu Tyr 
625 630 635 640 

Leu* Glu Arg Ala Val Gly Lys Val Glu Asp Lys Ser Ala Leu Glu Trp 
645 650 655 

Arg Asp Trp Pro Val Glu Lys Arg Leu Glu His Ser Leu Val Lys Gly 
660 665 670 

He Thr Glu Phe He Val Glu Asp Thr Glu Glu Ala Arg lie Asn Ala 
675 680 685 

Glu Arg Pro He Glu Val He Glu Gly Pro Leu Met Asp Gly Met Asn 
690 695 700 

Val Val Gly Asp Leu Phe Gly Glu Gly Lys Met Phe Leu Pro Gin Val 
70S 710 715 720 

Val Lys Ser Ala Arg Val Met Lys Gin Ala Val Ala His Leu Glu Pro 
725 730 735 

Phe He Asn Ala Ser Lys Glu Val Gly Ala Thr Asn Gly Lys He Leu 
740 745 750 

Leu Ala Thr Val Lys Gly Asp Val His Asp He Gly Lys Asn He Val 
755 760 765 

Gly Val Val Leu Gin Cys Asn Asn Tyr Glu He He Asp Leu Gly Val 
770 775 780 

Met Val Ser Cys Glu Thr He Leu Lys Val Ala Lys Glu Glu Asn Val 
785 790 795 800 

Asp He He Gly Leu Ser Gly Leu He Thr Pro Ser Leu Asp Glu Met 
805 810 815 

Val His Val Ala Lys Glu Met Glu Arg Gin Gly Phe Asp Leu Pro Leu 
820 825 830 

Leu He Gly Gly Ala Thr Thr Ser Lys Ala His Thr Ala Val Lys He 
835 840 845 

Glu Gin Asn Tyr Ser Gin Pro Val Val Tyr Val Asn Asn Ala Ser Arg 
850 855 860 

Ala Val Gly Val Cys Thr Ser Leu Leu Ser Asn Glu Leu Lys Pro Ser 
865 870 875 880 

Phe Val Glu Lys Leu Asp He Asp Tyr Glu Arg Val Arg Glu Gin His 
885 890 " 895 

Ser Arg Lys Gin Pro Arg Thr Lys Pro Val Thr Leu Glu Val Ala Arg 
900 905 910 

Ala Asn Lys Val Ala He Asp Trp Ala Ser Tyr Thr Pro Pro Val Pro 
915 920 925 

Leu Lys Pro Gly Val His He Phe Asp Asn Phe Asp Val Ser Thr Leu 
930 935 940 

Arg Asn Tyr lie Asp Trp Thr Pro Phe Phe Met Thr Trp Ser Leu Val 



WO 03/087386 PCT/EP03/04010 

207 

945 950 955 960 

Gly Lye Tyr Pro Lys He Leu Glu His Glu Glu Val Gly Glu Glu Ala 
965 970 975 

Lys Arg Leu Phe Lys Asp Ala Asn Asp Leu Leu Asp Arg Val Glu Lys 
980 985 990 

Glu Gly Leu Leu Lys Ala Arg Gly Met Cya Ala Leu Phe Pro Ala Ser 
995 1000 1005 

Ser Val Gly Asp Asp He Glu Val Tyr Thr Asp Glu Ser Arg Thr Thr 
1010 1015 1020 

Val Ala Lys Val Leu His Asn Leu Arg Gin Gin Thr Glu Lys Pro LyB 
102 * 1030 1035 * 1040 

Gly Phe Asn Tyr Cys Leu Ser Asp Tyr He Ala Pro Lys Glu Ser Gly 
1045 1050 1055 

Lys Asn Asp Trp He Gly Gly Phe Ala Val Thr Gly Gly He Gly Glu 
1060 1065 1070 

Arg Glu Leu Ala Asp Glu Tyr LyB Ala Asn Gly Asp Asp Tyr ABn Ala 
1075 1080 1085 

He Met He Gin Ala Val Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu 
1090 1095 lioo 

Tyr Leu His Glu Lys Val Arg Lys Glu He Trp Gly Tyr Ser Pro ABn 
110 5 1110 ins 1120 

Glu Thr Leu Ser Asn Asp Asp Leu He Arg Glu Lys Tyr Gin Gly lie 
1125 H30 - H35 

Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Glu His Thr Glu Lys Gly 
H40 H45 H50 

Ala Leu Trp Glu Leu Met Asn Val Glu Glu Ser lie Gly Met Ser Leu 
H55 H60 H65 

Thr Ser Ser Tyr Ala Met Trp Pro Gly Ala Ser Val Ser Gly Met Tyr 
1170 H75 1180 

Phe Ser His Pro Asp Ser Arg Tyr Phe Ala He Ala Gin lie Gin Gin 
H85 1190 1195 1200 

Asp Gin Ala Glu Ser Tyr Ala Asp Arg Lys Gly Trp Asn Met Leu Glu 
1205 1210 1215 

Ala Glu Lys Trp Leu Gly Pro Asn Leu Asn 
1220 1225 



<210> 47 
<211> 3780 
<212> DNA 

<213> Agrobacterium tumefaciens 

<220> 

<221> CDS 

<222> (1) . . (3777) 
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<223> 158B7359 
<400> 47 

gtg ccc gtg ttt gac gac ctg ttt ggc cct gaa ggg gca aag* cgc gac 46 
Val Pro Val Phe Asp Asp Leu Phe Gly Pro Glu Gly Ala Lys Arg Asp 
1 5 10 " is 

ggc gcg gaa att ttc aag gcg ttg cgc gat gcc gcc age gaa cgc ate 96 
Gly Ala Glu He Phe Lys Ala Leu Arg Asp Ala Ala Ser Glu Arg He 
20 25 30 

etc att etc gat ggt gcc atg ggc acg cag ate cag ggt etc ggt ttt 144 
Leu lie Leu Asp Gly Ala Met Gly Thr Gin He Gin Gly Leu Gly Phe 
35 40 45 



gac gag gat cat ttt cgt ggc gac cgt ttt ate ggc tgc gcc tgt cac 
Asp Glu Asp His Phe Arg Gly Asp Arg Phe He Gly Cys Ala Cys His 
50 55 60 



gcc ggt gcc ate ggt ccg acc aac cgc acg gcc teg ate teg cct gac 
Ala Gly Ala He Gly Pro Thr Asn Arg Thr Ala Ser He Ser Pro Asp 
145 150 155 160 



atg ate tea ggc acg ate acc gac ctt tec ggt cgc acg ttg tec ggc 
Met He Ser Gly Thr He Thr Asp Leu Ser Gly Arg Thr Leu Ser Gly 
225 230 235 240 



192 



cag aag ggc aat aac gac ctt ctg ate ctg aca cag ccc gat gcc ate 240 
Gin Lys Gly Asn Asn Asp Leu Leu He Leu Thr Gin Pro Asp Ala He 
6 5 70 75 80 

gag gaa ate cac tat cgc tac gcc atg gcg ggc gcg gat att etc gaa 288 
Glu Glu He His Tyr Arg Tyr Ala Met Ala Gly Ala Asp He Leu Glu 
85 90 95 

acc aac acg ttt tec tec acc cgc ate gcg cag gcc gat tac gag atg 336 
Thr Asn Thr Phe Ser Ser Thr Arg He Ala Gin Ala Asp Tyr Glu Met 
100 105 no 

gag aat gcc gtc tac gat etc aac cgc gag ggc gcg gcg ate gtg cgc 384 
Glu Asn Ala Val Tyr Asp Leu Asn Arg Glu Gly Ala Ala He Val Arg 
115 120 125 

egg gcg get cag cgc gcc gag cgc gag gat ggc cgc cgc cgt ttc gtg 432 
Arg Ala Ala Gin Arg Ala Glu Arg Glu Asp Gly Arg Arg Arg Phe Val 
130 135 140 



480 



gtc aac aat ccc ggt tac cgc gcc gtc agt ttc gac gat ctg cgc att 528 
Val Asn Asn Pro Gly Tyr Arg Ala Val Ser Phe Asp Asp Leu Arg He 
165 170 175 

gcc tat ggc gag cag ate gat ggc ctg ate gac ggt ggt gcc gat ate 576 
Ala Tyr Gly Glu Gin He Asp Gly Leu He Asp Gly Gly Ala Asp He 
180 185 190 

ate etc ate gag acg ate ttc gat acg ctg aac gcc aag gcg gcg ate 624 
He Leu lie Glu Thr lie Phe Asp Thr Leu Asn Ala Lys Ala Ala He 
195 200 205 

ttc gcc tgc gag gaa cgt ttc gag get aag ggc ate cgc ctg ccg gtc 672 
Phe Ala Cys Glu Glu Arg Phe Glu Ala Lys Gly He Arg Leu Pro Val 
210 215 220 



720 
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cag acg cct teg gcg ttc tgg aac teg gtg cgc cac gec aac ccc ttc 768 
Gin Thr Pro Ser Ala Phe Trp Asn Ser Val Arg His Ala Asn Pro Phe 
245 250 255 

acc ate ggc etc aac tgc gcg etc ggt gcg gat gee atg cgc ccg cat 816 
Thr He Gly Leu Asn Cys Ala Leu Gly Ala Asp Ala Met Arg Pro His 
260 265 270 



ctg cag gaa ctg tec gat gtg gee gac acc ttt gtc tgc gec tat ccg 
Leu Gin Glu Leu Ser Asp Val Ala Asp Thr Phe Val Cys Ala Tyr Pro 
275 280 285 



864 



aat gee ggc ctg ccg aac gag ttc ggc caa tat gac gaa acg ccc gag 912 
Asn Ala Gly Leu Pro Asn Glu Phe Gly Gin Tyr Asp Glu Thr Pro Glu 
290 295 300 

atg atg gcg cgc cag gtt gag ggc ttc gtt cgt gac ggt etc gtc aac 960 
Met Met Ala Arg Gin Val Glu Gly Phe Val Arg Asp Gly Leu Val Asn 
305 310 315 320 

ate gtc ggc ggt tgc tgc ggt teg acg ccg gaa cat ate egg gcg att 1008 
He Val Gly Gly Cys Cys Gly Ser Thr Pro Glu His He Arg Ala He 
325 330 335 

gee gaa gee gtc aag gat tac aag ccc cgc gaa att cct gaa cac aag 1056 
Ala Glu Ala Val Lys Asp Tyr Lys Pro Arg Glu He Pro Glu His Lys 
340 345 350 

ccg ttc atg teg ctt tec ggc ctt gaa ccc ttc gtg ctg acc aag gac 1104 
Pro Phe Met Ser Leu Ser Gly Leu Glu Pro Phe Val Leu Thr Lys Asp 
355 360 365 

att ccc ttc gtc aac gtg ggc gag cgc acc aac gtc acc ggt teg gec 1152 
He Pro Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Ala 
370 375 380 

cgc ttc cgc aag etc ate act gee ggc gac tat acg gcg gcg ctg get 1200 
Arg Phe Arg Lys Leu He Thr Ala Gly Asp Tyr Thr Ala Ala Leu Ala 
385 390 395 400 

gtt gec cgc gac cag gtg gaa aac ggc gcg cag ate ate gac ate aac 124 8 
Val Ala Arg Asp Gin Val Glu Asn Gly Ala Gin He He Asp He Asn 
405 410 415 

atg gat gag ggc ctg ate gat teg gaa aag gcg atg gtc gag ttc ctg 1296 
Met Asp Glu Gly Leu He Asp Ser Glu Lys Ala Met Val Glu Phe Leu 
420 425 430 

aac etc ate gec gec gag cct gac att gec cgt gtg ccc gtc atg ate 1344 
Asn Leu He Ala Ala Glu Pro Asp He Ala Arg Val Pro Val Met He 
435 440 ~ 445 

gac tea tec aag ttc gag ate ate gag gee ggc ctg aaa tgc gtg cag 1392 
Asp Ser Ser Lys Phe Glu He He Glu Ala Gly Leu Lys Cys Val Gin 
450 455 460 

ggc aaa teg ate gtc aat tec att teg ctg aag gaa ggc gag gag aag 144 0 
Gly Lys Ser He Val Asn Ser He Ser Leu Lys Glu Gly Glu Glu LyB 
465 470 475 480 

ttt etc cag cag get egg etc gtc cac aat tac ggt gcg gcg gtt gtc 14 88 
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Phe Leu Gin Gin Ala Arg Leu Val His Asn Tyr Gly Ala Ala Val Val 
485 490 " 495 

gtc atg gcc ttt gat gag gtc ggg cag gcg gat acc tat cag, cgc aag 1536 
Val Met Ala Phe Asp Glu Val Gly Gin Ala Asp Thr Tyr Gin Arg Lys 
500 505 510 

gtg gaa ate tgc gcg cgc gcc tac aag ctt ctg acc gaa aag gcc ggt 1584 
Val Glu He Cys Ala Arg Ala Tyr Lys Leu Leu Thr Glu Lys Ala Gly 
515 520 525 

ctg tct ccg gaa gac ate ate ttc gac ccg aat gtg ttt gcg gta get 1632 
Leu Ser Pro Glu Asp He He Phe Asp Pro Asn Val Phe Ala Val Ala 
530 535 540 

acg ggc ate gag gag cac aat aat tac ggc gtg gac ttc ate gag gcc 1680 
Thr Gly He Glu Glu His Asn Asn Tyr Gly Val Asp Phe He Glu Ala 
545 550 555 560 

acc aag acc ate cgc gaa acc atg ccg etc acg cat att tec ggg ggc 1728 
Thr Lys Thr He Arg Glu Thr Met Pro Leu Thr His He Ser Gly Gly 
565 570 575 

gtt tec aac ctg tec ttc tec ttc cgc ggc aat gag ccg gtg cgt gag 1776 
Val Ser Asn Leu Ser Phe Ser Phe Arg Gly Asn Glu Pro Val Arg Glu 
580 585 590 

gcg atg cat gcc gtg ttc etc tat cac gcc att cag gtc ggc atg gat 1824 
Ala Met His Ala Val Phe Leu Tyr His Ala He Gin Val Gly Met Asp 
595 600 605 

atg ggc ate gtc aac gcc ggg cag ctt gcg gtt tac gac aat ate gat 1872 
Met Gly He Val Asn Ala Gly Gin Leu Ala Val Tyr Asp Asn He Asp 
610 615 620 

gcg gaa ctg cgc gag gcc tgc gaa gac gtg gtg ctg aac cgc cgc gac 1920 
Ala Glu Leu Arg Glu Ala Cys Glu Asp Val Val Leu Asn Arg Arg Asp 
625 630 635 640 

gat gcc acg gag cgt ctg etc gag gtg gcg gag cgt ttc cgt ggt acg 1968 
Asp Ala Thr Glu Arg Leu Leu Glu Val Ala Glu Arg Phe Arg Gly Thr 
645 650 ° 655 

ggt gaa aaa cag gcc aag gtg cag gat ctt tec tgg cgc gag tat ccc 2016 
Gly Glu Lys Gin Ala Lys Val Gin Asp Leu Ser Trp Arg Glu Tyr Pro 
660 665 670 

gtt gaa aag egg ctg gaa cat get ctg gtc aac ggc att acc gac tat 2064 
Val Glu Lys Arg Leu Glu His Ala Leu Val Asn Gly He Thr Asp Tyr 
675 680 665 

ate gag gcc gat acg gaa gag gca cgc cag cag gcc gcc cgc ccg ctg 2112 
He Glu Ala Asp Thr Glu Glu Ala Arg Gin Gin Ala Ala Arg Pro Leu 
690 695 700 

cat gtc ate gaa ggg ccg ctg atg gcc ggt atg aat gtg gtg ggt gac 2160 
His Val He Glu Gly Pro Leu Met Ala Gly Met Asn Val Val Gly Asp 
70S 710 715 720 

ctg ttc ggt tec ggc aag atg ttc ctg cca cag gtg gtg aaa tec gcc 2208 
Leu Phe Gly Ser Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala 
725 730 735 
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cgt gtg atg aag cag gcg gtt gcc gtt ctg ctg cct tac atg gaa gag 2256 
Arg val Met Lys Gin Ala Val Ala Val Leu Leu Pro Tyr Met Glu Glu 
740 745 750 

gaa aag cgc ctg aat ggc ggt tec gag cgc agt gcc gcc ggc aag gtg 2304 
Glu Lys Arg Leu Asn Gly Gly Ser Glu Arg Ser Ala Ala Gly Lys Val 
755 760 765 

eta atg gcg acc gtg aag ggc gac gtg cac gat ate ggc aag aac ate 2352 
Leu Met Ala Thr Val Lys Gly Asp Val His Asp He Gly Lys Asn He 
770 775 780 



gtc ggc gtt gtg eta gcc tgc aac aat tac gag ate att gat etc ggc 
Val Gly Val Val Leu Ala Cys Asn Asn Tyr Glu He He Asp Leu Gly 
785 790 795 



800 



2400 



gtg atg gtg ccg acg acg aaa ate etc gaa acg gcg ate gcc gaa aag 2448 
Val Met Val Pro Thr Thr Lys He Leu Glu Thr Ala He Ala Glu Lys 
805 810 815 

gtg gat gtg ate ggc etc tec ggc etc ate acc ccg teg ctg gat gag 2496 
Val Asp Val He Gly Leu Ser Gly Leu He Thr Pro Ser Leu Asp Glu 
820 825 830 

atg gtg cat gtg gcg gcc gaa atg gag cga cag ggt ttc gac att ccg 2544 
Met val His Val Ala Ala Glu Met Glu Arg Gin Gly Phe Asp He Pro 
835 840 845 

ctg ctg ate ggc ggt gcg acg acc age cgt gtg cat acg gcg gta aaa 2592 
Leu Leu He Gly Gly Ala Thr Thr Ser Arg Val His Thr Ala Val Lys 
850 855 860 

ate cat ccg cgt tac gag cag ggg cag gcg ate tat gtc acc gac gcc 2640 
He His Pro Arg Tyr Glu Gin Gly Gin Ala He Tyr Val Thr Asp Ala 
865 870 875 860 

teg cgc gcg gtg ggc gtc gtt tea gcg etc etc tec gaa gag cag aag 2688 
Ser Arg Ala Val Gly Val Val Ser Ala Leu Leu Ser Glu Glu Gin Lys 
885 890 895 

ccc get tat ate gac ggc ate cga gcc gaa tat gcc aag gtg gcg gaa 2736 
Pro Ala Tyr He Asp Gly He Arg Ala Glu Tyr Ala Lys Val Ala Glu 
900 905 910 

gcc cat gcc cgc aat gag cgc gaa aag cag cgc ctg ccg ctt tec cgc 2784 
Ala His Ala Arg Asn Glu Arg Glu Lys Gin Arg Leu Pro Leu Ser Arg 
915 920 ** 925 

gcc egg gag aat gcg cac aag ate gac tgg teg age tac age gtt gtc 2832 
Ala Arg Glu Asn Ala His Lys He Asp Trp Ser Ser Tyr Ser Val Val 
930 935 940 

aag ccg cag ttc ttc ggc acc aag gtt ttt gag acc tat gat ctg gaa 2B80 
Lys Pro Gin Phe Phe Gly Thr Lys Val Phe Glu Thr Tyr Asp Leu Glu 
9« 950 955 960 

gag ctt tec cgt tac ate gac tgg acg ccg ttc ttc cag acc tgg gaa 2928 
Glu Leu Ser Arg Tyr lie Asp Trp Thr Pro Phe Phe Gin Thr Trp Glu 
965 970 975 

ttg aag ggc cgt ttc ccg gcg ate ctt gaa gac gaa aag cag ggc gag 2976 
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Leu Lys Gly Arg Phe Pro Ala He Leu Glu Asp Glu Lys Gin Gly Glu 
980 985 990 



gcg gcg egg cag ctt tat gec gat gcg cag gec atg ctt gcg aag ate 
Ala Ala Arg Gin Leu Tyr Ala Asp Ala Gin Ala Met Leu Ala Lys He 
995 1000 1005 



3024 



ate gag gaa aag tgg ttc cga cca cgc gcg gtg ate ggc ttc tgg ccg 
He Glu Glu Lys Trp Phe Arg Pro Arg Ala Val He Gly Phe Trp Pro 
1010 1015 1020 



3072 



gee aat gee gtg ggt gac gat ate agg etc ttt acg gat gaa ggt egg 
Ala Asn Ala Val Gly Asp Asp lie Arg Leu Phe Thr Asp Glu Gly Arg 
1025 1030 1035 1040 



3120 



aag gaa gag ttg gcg acg ttc ttc acg ctg cgc cag cag ctt tec aag 
Lys Glu Glu Leu Ala Thr Phe Phe Thr Leu Arg Gin Gin Leu Ser Lys 
1045 1050 1055 



3168 



cgc gat ggc cgt ccg aac gtg gcg ctg tec gat ttc gtc gcg ccc gtc 
Arg Asp Gly Arg Pro Asn Val Ala Leu Ser Asp Phe Val Ala Pro Val 
1060 1065 1070 



3216 



_ gat _agc ggc.gtt gec gat- tat gtc ggc ggt ttc gtg gta acg gcg ggt 
Asp Ser Gly Val Ala Asp Tyr Val Gly Gly Phe Val Val Thr Ala Gly 
1075 1080 1085 



3264 



ate gag gaa gtg gcg att gee gag cgc ttc gag egg gee aat gac gat 
He Glu Glu Val Ala He Ala Glu Arg Phe Glu Arg Ala Asn Asp Asp 
1090 1095 1100 



3312 



tat teg tec ate etc gtc aag gcg ttg get gac cgt ttt gee gaa gec 
Tyr Ser Ser He Leu Val Lys Ala Leu Ala Asp Arg Phe Ala Glu Ala 
H05 1110 His 1120 



3360 



ttt gec gag cgt atg cat gag cgc gtg cgc aag gag ttc tgg ggt tat 
Phe Ala Glu Arg Met His Glu Arg Val Arg Lys Glu Phe Trp Gly Tyr 
1125 1130 1135 



3408 



gcg ccg gac gag get ctt gee ggt gac gat ctg ata ggc gaa gee tat 
Ala Pro Asp Glu Ala Leu Ala Gly Asp Abp Leu He Gly Glu Ala Tyr 
1140 1145 1150 



3456 



gec ggt ate cgc ccg gca ccg ggt tat ccg gee cag ccg gac cac ace 
Ala Gly He Arg Pro Ala Pro Gly Tyr Pro Ala Gin Pro Asp His Thr 
1155 1160 1165 



3504 



gaa aag aag acg ctg ttt get ctg ctg gac gee ace aat gcg gcg ggt 
Glu Lys Lys Thr Leu Phe Ala Leu Leu Asp Ala Thr Asn Ala Ala Gly 
1170 1175 H80 



3552 



gtg gaa ttg acg gaa age tat gcg atg tgg ccc ggc teg teg gtt teg 
val Glu Leu Thr Glu Ser Tyr Ala Met Trp Pro Gly Ser Ser Val Ser 
1185 1190 1195 1200 



3600 



ggc etc tat ate ggc cat ccc gaa age tat tat ttc ggc gtt gec aag 3648 
Gly Leu Tyr He Gly His Pro Glu Ser Tyr Tyr Phe Gly Val Ala Lys 
1205 1210 1215 

gtg gag egg gat cag gtt etc gac tat gcg cgc cgc aag gat atg ccg 3696 
val Glu Arg Asp Gin Val Leu Asp Tyr Ala Arg Arg Lys Asp Met Pro 
1220 1225 1230 
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gtc aca gag gtg gag cgc tgg etc ggg ccg gtg etc aac tac gtg ccg 3744 
Val Thr Glu Val Glu Arg Tip Leu Gly Pro Val Leu Asn Tyr Val Pro 
1235 1240 1245 

acc aac ggc gag gag aaa ate gac age get gcg tga 3780 
Thr Asn Gly Glu Glu Lys He Asp Ser Ala Ala 
1250 1255 



<210> 48 
<211> 1259 
<212> PRT 

<213> Agrobacterium tumefaciens 
<400> 48 

Val Pro Val Phe Asp Asp Leu Phe Gly Pro Glu Gly Ala Lys Arg Asp 
15 10 15 

Gly Ala Glu He Phe Lys Ala Leu Arg Asp Ala Ala Ser Glu Arg He 
20 25 30 

Leu He Leu Asp Gly Ala Met Gly Thr Gin He Gin Gly Leu Gly Phe 
35 40 45 

Asp Glu Asp His Phe Arg Gly Asp Arg Phe He Gly Cys Ala Cys His 
50 55 60 

Gin Lys Gly Asn Asn Asp Leu Leu He Leu Thr Gin Pro Asp Ala He 
65 70 75 80 

Glu Glu He His Tyr Arg Tyr Ala Met Ala Gly Ala Asp He Leu Glu 
85 90 95 

Thr Asn Thr Phe Ser Ser Thr Arg He Ala Gin Ala Asp Tyr Glu Met 
100 105 110 

Glu Asn Ala Val Tyr Asp Leu Asn Arg Glu Gly Ala Ala He Val Arg 
115 120 125 

Arg Ala Ala Gin Arg Ala Glu Arg Glu Asp Gly Arg Arg Arg Phe Val 
130 135 140 

Ala Gly Ala He Gly Pro Thr Asn Arg Thr Ala Ser He Ser Pro Asp 
145 150 155 160 

Val Asn Asn Pro Gly Tyr Arg Ala Val Ser Phe Asp Asp Leu Arg He 
165 170 175 

Ala Tyr Gly Glu Gin He Asp Gly Leu He Asp Gly Gly Ala Asp He 
180 185 190 

lie Leu He Glu Thr He Phe Asp Thr Leu Asn Ala Lys Ala Ala He 
195 200 205 

Phe Ala Cys Glu Glu Arg Phe Glu Ala Lys Gly He Arg Leu Pro Val 
210 215 220 

Met He Ser Gly Thr He Thr Asp Leu Ser Gly Arg Thr Leu Ser Gly 
225 230 235 240 

Gin Thr Pro Ser Ala Phe Trp Asn Ser Val Arg His Ala Asn Pro Phe 



f 



WO 03/087386 



PCT/EP03/04010 



214 



245 



250 



255 



Thr lie Gly Leu Asn Cys Ala Leu Gly Ala Asp Ala Met Arg Pro His 
260 265 270 

Leu Gin Glu Leu Ser Asp Val Ala Asp Thr Phe Val Cys Ala Tyr Pro 
275 280 285 

Asn Ala Gly Leu Pro Asn Glu Phe Gly Gin Tyr Asp Glu Thr Pro Glu 
290 295 300 

Met Met Ala Arg Gin Val Glu Gly Phe Val Arg Asp Gly Leu Val Asn 
305 310 315 320 

He Val Gly Gly Cys Cys Gly Ser Thr Pro Glu His He Arg Ala He 
325 330 335 

Ala Glu Ala Val Lys Asp Tyr Lys Pro Arg Glu He Pro Glu His Lys 
340 345 350 

Pro Phe Met Ser Leu Ser Gly Leu Glu Pro Phe Val Leu Thr Lys Asp 
355 360 365 

He Pro Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Ala 
370 375 380 

Arg Phe Arg Lys Leu He Thr Ala Gly Asp Tyr Thr Ala Ala Leu Ala 
385 390 395 400 

Val Ala Arg Asp Gin Val Glu Asn Gly Ala Gin He He Asp He Asn 
405 410 415 

Met Asp Glu Gly Leu He Asp Ser Glu Lys Ala Met Val Glu Phe Leu 
420 425 430 

Asn Leu He Ala Ala Glu Pro Asp He Ala Arg Val Pro Val Met He 
435 440 445 

Asp Ser Ser Lys Phe Glu He He Glu Ala Gly Leu Lys Cys Val Gin 
450 455 460 

Gly Lys Ser He Val Asn Ser lie Ser Leu Lys Glu Gly Glu Glu Lys 
465 470 475 480 

Phe Leu Gin Gin Ala Arg Leu Val His Asn Tyr Gly Ala Ala Val Val 
485 490 495 

Val Met Ala Phe Asp Glu Val Gly Gin Ala Asp Thr Tyr Gin Arg Lys 
500 505 510 

Val Glu He Cys Ala Arg Ala Tyr LyB Leu Leu Thr Glu Lys Ala Gly 
515 520 525 

Leu Ser Pro Glu Asp He He Phe Asp Pro Asn Val Phe Ala Val Ala 
530 535 540 

Thr Gly He Glu Glu His Asn Asn Tyr Gly Val Asp Phe He Glu Ala 
545 550 555 560 

Thr Lys Thr He Arg Glu Thr Met Pro Leu Thr His He Ser Gly Gly 



565 



570 



575 
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Val ser Asn Leu Ser Phe Ser Phe Arg Gly Asn Glu Pro Val Arg olu 
580 585 590 

Ala Met Hie Ala Val Phe Leu Tyr His Ala lie Gin Val Gly Net Asp 
595 600 605 

Met Gly lie Val Asn Ala Gly Gin Leu Ala Val Tyr Asp Asn He Asp 
610 615 620 

Ala Glu Leu Arg Glu Ala Cys Glu Asp Val Val Leu Asn Arg Arg Asp 
625 630 635 640 

Asp Ala Thr Glu Arg Leu Leu Glu Val Ala Glu Arg Phe Arg Gly Thr 
645 650 655 

Gly Glu Lys Gin Ala Lys Val Gin Asp Leu Ser Trp Arg Glu Tyr Pro 
660 665 670 

Val Glu Lys Arg Leu Glu His Ala Leu Val Asn Gly He Thr Asp Tyr 
675 680 685 

He Glu Ala Asp Thr Glu Glu Ala Arg Gin Gin Ala Ala Arg Pro Leu 
690 695 700 

His Val He Glu Gly Pro Leu Met Ala Gly Met Asn Val Val Gly Asp 
705 710 715 720 

Leu Phe Gly Ser Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala 
725 730 735 

Arg Val Met Lys Gin Ala Val Ala Val Leu Leu Pro Tyr Met Glu Glu 
740 745 750 

Glu Lys Arg Leu Asn Gly Gly Ser Glu Arg Ser Ala Ala Gly Lys Val 
755 760 765 

Leu Met Ala Thr Val Lys Gly Asp Val His Asp He Gly Lys Asn He 
770 775 780 

Val Gly Val Val Leu Ala Cys Asn Asn Tyr Glu He He Asp Leu Gly 
785 790 795 800 

Val Met Val Pro Thr Thr Lys He Leu Glu Thr Ala He Ala Glu Lys 
805 810 815 

Val Asp Val He Gly Leu Ser Gly Leu He Thr Pro Ser Leu Abp Glu 
820 825 830 

Met Val His Val Ala Ala Glu Met Glu Arg Gin Gly Phe Asp He Pro 
835 840 845 

Leu Leu He Gly Gly Ala Thr Thr Ser Arg Val His Thr Ala Val Lys 
850 855 860 

He His Pro Arg Tyr Glu Gin Gly Gin Ala He Tyr Val Thr Asp Ala 
865 870 875 * 880 

Ser Arg Ala Val Gly Val Val Ser Ala Leu Leu Ser Glu Glu Gin Lys 
885 890 895 

Pro Ala Tyr He Asp Gly He Arg Ala Glu Tyr Ala Lys Val Ala Glu 
900 905 910 
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Ala His Ala Arg Asn Glu Arg Glu Lye Gin Arg Leu Pro Leu Ser Arg 
915 920 ^ 925 

Ala Arg Glu Asn Ala His Lys He Asp Trp Ser Ser Tyr Ser Val Val 
930 935 940 

Lys Pro Gin Phe Phe Gly Thr Lys Val Phe Glu Thr Tyr Asp Leu Glu 
345 950 955 960 

Glu Leu Ser Arg Tyr He Asp Trp Thr Pro Phe Phe Gin Thr Trp Glu 
965 970 975 

Leu Lys Gly Arg Phe Pro Ala He Leu Glu Asp Glu Lys Gin Gly Glu 
980 985 990 

Ala Ala Arg Gin Leu Tyr Ala Asp Ala Gin Ala Met Leu Ala Lys He 
995 1000 1005 

He Glu Glu Lys Trp Phe Arg Pro Arg Ala Val He Gly Phe Trp Pro 
1010 1015 1020 

Ala Asn Ala Val Gly Asp Asp He Arg Leu Phe Thr Asp Glu Gly Arg 
1025 1030 1035 1040 

Lys Glu Glu Leu Ala Thr Phe Phe Thr Leu Arg Gin Gin Leu Ser Lys 
1045 1050 " 1055 

Arg Asp Gly Arg Pro ABn Val Ala Leu Ser Asp Phe Val Ala Pro Val 
1060 1065 1070 

Asp Ser Gly Val Ala Asp Tyr Val Gly Gly Phe Val Val Thr Ala Gly 
1075 1080 1085 

He Glu Glu Val Ala He Ala Glu Arg Phe Glu Arg Ala Asn Asp Asp 
1090 1095 1100 

Tyr Ser Ser He Leu Val Lys Ala Leu Ala Asp Arg Phe Ala Glu Ala 
H05 1110 U15 H20 

Phe Ala Glu Arg Met His Glu Arg Val Arg Lys Glu Phe Trp Gly Tyr 
1125 1130 1135 

Ala Pro Asp Glu Ala Leu Ala Gly Asp Asp Leu He Gly Glu Ala Tyr 
1140 1145 1150 

Ala Gly He Arg Pro Ala Pro Gly Tyr Pro Ala Gin Pro Asp His Thr 
1155 1160 H65 

Glu Lys Lys Thr Leu Phe Ala Leu Leu Asp Ala Thr Asn Ala Ala Gly 
1170 1175 1180 

Val Glu Leu Thr Glu Ser Tyr Ala Met Trp Pro Gly Ser Ser Val Ser 
1185 1190 1195 1200 

Gly Leu Tyr He Gly His Pro Glu Ser Tyr Tyr Phe Gly Val Ala Lys 
1205 1210 1215 

Val Glu Arg Asp Gin Val Leu Asp Tyr Ala Arg Arg Lys Asp Met Pro 
1220 1225 1230 

Val Thr Glu Val Glu Arg Trp Leu Gly Pro Val Leu Asn Tyr Val Pro 
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1235 1240 1245 

Thr Asn Gly Glu Glu Lys lie Asp Ser Ala Ala 
1250 1255 



<210> 49 








<211> 2718 








<212> DNA 








<213> Ralstonia solanacearum 








<220> 








<221> CDS 








<222> (1) . . (2715) 








<223> RSOLJ3MI1000 








<400> 49 








atg acc gac cac etc atg cgc etc tec ggc 


etc gaa ccg 


ttc 


aac 


Met Thr Asp His Leu Met Arg Leu Ser Gly 


Leu Glu Pro 


Phe 


Asn 


1 5 10 






15 



48 



ggc gag gac acg ctg ttc gtc aac gtc ggc gaa cgc acc aac gtc acc 96 

Gly Glu Asp Thr Leu Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr 
20 25 30 

gga tec aag gcg ttc gcg cgc atg ate etc aac age cag ttc gac gag 144 

Gly Ser Lys Ala Phe Ala Arg Met lie Leu Asn Ser Gin Phe Asp Glu 
35 40 45 

gcg etc gec gtg gca cgc cag cag gtc gag aac ggc gcg cag gtc ate 192 
Ala Leu Ala Val Ala Arg Gin Gin Val Glu Asn Gly Ala Gin Val lie 
50 55 60 

gac ate aac atg gac gag gee atg etc gac tec aag gcg gcg atg gtg 240 

Asp lie Asn Met Asp Glu Ala Met Leu Asp Ser Lys Ala Ala Met Val 
65 70 75 80 

cgc ttc ctg aac ctg ate gee teg gag ccg gac ate gcg cgc gtg ccg 288 

Arg Phe Leu Asn Leu lie Ala Ser Glu Pro Asp He Ala Arg Val Pro 
85 90 95 

ate atg ate gac teg tec aag tgg gag gtg ate gag gee ggc ctg aag 336 
He Met He Asp Ser Ser Lys Trp Glu Val He Glu Ala Gly Leu Lys 
100 105 110 

tgc gtg cag ggc aag gee ate gtc aac teg ate teg etc aag gaa ggc 384 

Cys Val Gin Gly Lys Ala He Val Asn Ser lie Ser Leu Lys Glu Gly 
115 120 125 

gag gaa cag ttc gee cac cac gec aag ctg ate aag cgc tac ggc gee 432 

Glu Glu Gin Phe Ala His His Ala Lys Leu lie Lys Arg Tyr Gly Ala 
130 135 140 

gec gee gtg gtg atg gee ttc gac gag cag ggc cag gee gac acg ttc 480 

Ala Ala Val Val Met Ala Phe Asp Glu Gin Gly Gin Ala Asp Thr Phe 
145 150 155 160 

gcg cgc aag acc gag ate tgc aag cgc age tat gac ttc etc gtg aac 528 

Ala Arg Lys Thr Glu He Cys Lys Arg Ser Tyr Asp Phe Leu Val Asn 

165 170 175 

cag gtc ggc ttt gcg ccg gaa gac ate ate ttc gat ccg aac ate ttc 576 
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Gin Val Gly Phe Ala Pro Glu Asp He He Phe Asp Pro Asn He Phe 
180 185 ~ 190 

gcg gtc gcc acc ggc ate gag gag cac aac aac tac gec gtc gac ttc 624 
Ala Val Ala Thr Gly He Glu Glu His Asn Asn Tyr Ala Val Asp Phe 
195 200 205 

ate gag gcc acg cgc tgg ate aag cag aaa ttg ccg cac gcc aag gtg 672 
He Glu Ala Thr Arg Trp He Lys Gin Lys Leu Pro His Ala Lys Val 
210 215 220 

age ggc ggc gtg teg aac gtc teg ttc teg ttc cgc ggc aac gac gtg 720 
Ser Gly Gly Val Ser A8n Val Ser Phe Ser Phe Arg Gly Asn Asp Val 
225 230 235 240 

gtg cgc gag gcc ate cac acc gtg ttc ctg tac cac gcc ate ggt gcg 768 
Val Arg Glu Ala He His Thr Val Phe Leu Tyr His Ala He Gly Ala 
245 250 255 

ggc atg gac atg ggc ate gtc aac gcg ggc cag ttg ggc gtg tac gag 816 
Gly Met Asp Met Gly He Val Asn Ala Gly Gin Leu Gly Val Tyr Glu 
260 265 270 

aac etc gcc ccc gaa ctg cgc gag cgc gtg gaa gac gtg gtg etc aac 864 
Asn Leu Ala Pro Glu Leu Arg Glu Arg Val Glu Asp Val Val Leu Asn 
275 280 285 

cgc cgc ccg gat gcg acc gac cgc ctg ctg gaa att gcc gac cgc tac 912 
Arg Arg Pro Asp Ala Thr Asp Arg Leu Leu Glu He Ala Asp Arg Tyr 
290 295 300 

aag ggc ggc ggc gcc aag cgc gag gag aac etc gcc tgg cgc cag gag 960 
Lys Gly Gly Gly Ala Lys Arg Glu Glu Asn Leu Ala Trp Arg Gin Glu 
305 310 315 320 

ccg gtg gaa aag cgc ctg gcc cac gcg etc gtg cac ggc ate acc gac 1008 
Pro Val Glu Lys Arg Leu Ala His Ala Leu Val His Gly He Thr Asp 
325 330 335 

tac gtg gtc gaa gac acc gag gaa gtt cgc cag aag ate ttt gcc gcc 1056 
Tyr Val Val Glu Asp Thr Glu Glu Val Arg Gin Lys He Phe Ala Ala 
340 345 350 

ggc ggc cgc ccg ate cag gtg ate gag ggc ccg ctg atg gac ggc atg 1104 
Gly Gly Arg Pro He Gin Val He Glu Gly Pro Leu Met Asp Gly Met 
355 360 365 

aac ate gtc ggc gat ctg ttc ggc gcg ggc aag atg ttc ctg ccg cag 1152 
Asn He Val Gly Asp Leu Phe Gly Ala Gly Lys Met Phe Leu Pro Gin 
370 375 380 

gtg gtg aaa tec gcc cgc gtg atg aag cag gcg gtg gcc cac ctg ate 1200 
Val Val Lys Ser Ala Arg Val Met Lys Gin Ala Val Ala His Leu He 
385 390 395 400 

ccg ttc ate gag gaa gag aag egg cag ate gcg gcc gcc ggc ggc gac 1248 
Pro Phe He Glu Glu Glu Lys Arg Gin He Ala Ala Ala Gly Gly Asp 
405 410 415 

gtg cgc teg cgc ggc aag ate gtc ate gcc acc gtg aag ggc gac gtg 1296 
Val Arg Ser Arg Gly Lys He Val He Ala Thr Val Lys Gly Asp Val 
420 425 430 
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cac gac ate ggc aag aac ate gtc ace gtc gtg etc cag tgc aac aac 1344 
His Asp lie Gly LyB Asn He Val Thr Val Val Leu Gin Cys Asn Asn 
435 440 445 

ttc gaa gtc gtg aac atg ggc gtg atg gtc ccg tgc aac gag ate ctg 1392 
Phe Glu Val Val Asn Met Gly Val Met Val Pro Cys Asn Glu He Leu 
450 455 460 

gee aag gcg aag gtc gag ggc gcg gac ate ate ggc ctg teg ggc ctg 1440 
Ala Lys Ala Lys Val Glu Gly Ala Asp He He Gly Leu Ser Gly Leu 
465 470 475 480 

ate aca ccg teg ctg gaa gag atg gee tac gtg gee tec gag atg cag 1488 
He Thr Pro Ser Leu Glu Glu Met Ala Tyr Val Ala Ser Glu Met Gin 
485 490 495 

cgc gac gag tac ttc cgc gtg aag aag ate ccg ctg ctg ate ggt ggc 1536 
Arg Asp Glu Tyr Phe Arg Val Lys Lys He Pro Leu Leu He Gly Gly 
500 505 510 

gcg ace acg age cgc gtg cac ace gee gtg aag ate gcg ccc aat tac 1584 
Ala Thr Thr Ser Arg Val His Thr Ala Val Lys He Ala Pro Asn Tyr 
515 520 525 

gaa ggc ccg gtc gtg tac gtg ccc gac gee teg cgc teg gtg age gtg 1632 
Glu Gly Pro Val Val Tyr Val Pro Asp Ala Ser Arg Ser Val Ser Val 
530 535 540 

gee tec age ctg ctg tec gac gag gee gec gcg cgc tac ate gaa gag 1680 
Ala Ser Ser Leu Leu Ser Asp Glu Ala Ala Ala Arg Tyr He Glu Glu 
545 550 555 560 

ctg cac gee gac tac gac cgc ate cgc ace cag cac gec age aag aaa 1728 
Leu His Ala Asp Tyr Asp Arg He Arg Thr Gin His Ala Ser Lys Lys 
565 570 575 

gec atg ccg atg gtg teg ctg gee gee gcg cgc gee aac aag ace egg 1776 
Ala Met Pro Met Val Ser Leu Ala Ala Ala Arg Ala Asn Lys Thr Arg 
580 585 590 

ate gac tgg teg aac tac acg ccg ccc aag ccc aag ttc gtc ggc cgc 1824 
He Asp Trp Ser Asn Tyr Thr Pro Pro Lys Pro Lys Phe Val Gly Arg 
595 600 605 

cgc gtg ttc cgc aac tac gac ctg aac gag etc gcg cag tac ate gac 1872 
Arg Val Phe Arg Asn Tyr Asp Leu Asn Glu Leu Ala Gin Tyr He Asp 
610 615 620 

tgg ggc ccg ttc ttc cag acg tgg gac ctg gee ggc aaa ttc ccc gac 1920 
Trp Gly Pro Phe Phe Gin Thr Trp Asp Leu Ala Gly Lys Phe Pro Asp 
625 630 635 640 

ate etc aac gac gcg ate gtc ggc gaa teg gec cgc cgc gtg ttc tec 1968 
He Leu Asn Asp Ala He Val Gly Glu Ser Ala Arg Arg Val Phe Ser 
645 650 " 655 

gac ggc aag age atg etc gcg cgc ctg ate gec gga cgc tgg ctg acg 2016 
Asp Gly Lys Ser Met Leu Ala Arg Leu He Ala Gly Arg Trp Leu Thr 
660 665 670 

gec aac ggc gtg ate gcg ctg ctg ccg gee aac ace gtc aac gac gac 2064 
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Ala Asn Gly Val lie Ala Leu Leu Pro Ala Asn Thr Val Asn Asp Asp 
675 680 685 

gac ate gag ate tac ace gac gag ace cgc teg gaa gtc gee etc acc 2112 
Asp He Glu He Tyr Thr Asp Glu Thr Arg Ser Glu Val Ala Leu Thr 
690 695 700 

tgg cgc aac ate cgc cag cag age gag cgc ccg ate ate gac ggc gtg 2160 
Trp Arg Asn lie Arg Gin Gin Ser Glu Arg Pro He He Asp Gly Val 
705 710 715 720 

atg cgc ccg aac cgc tgc ctg gcg gac ttc ate gec ccc aag gac acc 2208 
Met Arg Pro Asn Arg Cys Leu Ala Asp Phe He Ala Pro Lys Asp Thr 
725 730 735 

ggc ate gec gat tac ate ggc etc ttc gcg gtg acg ggc ggc ate ggg 2256 
Gly He Ala Asp Tyr He Gly Leu Phe Ala Val Thr Gly Gly He Gly 
740 745 750 

ate gac aag cgc gaa gec gec ttc gaa gec gac cac gac gac tac age 2304 
He Asp Lys Arg Glu Ala Ala Phe Glu Ala Asp His Asp Asp Tyr Ser 
755 760 765 

gcg ate atg etc aag gec ctg gee gac cgc ttc gec gaa gee ttc gec 2352 
Ala He Met Leu Lys Ala Leu Ala Asp Arg Phe Ala Glu Ala Phe Ala 
770 775 780 

gag tgc ctg cac gec cgt gtg cgc cgc gac ctg tgg ggc tac gcg cag 2400 
Glu Cys Leu His Ala Arg Val Arg Arg Asp Leu Trp Gly Tyr Ala Gin 
785 790 795 800 

gac gaa acg etc gac aac gac gcg ctg ate cgc gag gaa tac cgc ggc 2448 
Asp Glu Thr Leu Asp Asn Asp Ala Leu He Arg Glu Glu Tyr Arg Gly 
805 810 815 

ate cgc ccg gcg ccc ggc tac ccg gec tgc ccg gag cac acc gtc aag 2496 
He Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Glu His Thr Val Lye 
820 825 830 

cgc gac ctg ttc cgc gtg etc gac gcg cag gag ate ggc atg aac ctg 2544 
Arg Asp Leu Phe Arg Val Leu Asp Ala Gin Glu He Gly Met Asn Leu 
835 840 845 

acc gag gcg ctg gcg atg aca ccg gec gcg teg gtc teg ggc ttc cag 2592 
Thr Glu Ala Leu Ala Met Thr Pro Ala Ala Ser Val Ser Gly Phe Gin 
850 855 860 

ctg teg cac ccg gac age acg tac ttc acg ate ggc aag ate ggc cag 2640 
Leu Ser His Pro Asp Ser Thr Tyr Phe Thr He Gly Lys He Gly Gin 
865 870 875 880 

gac cag gtg gac gac atg gee gcg cgc age ggg gaa gac cgc cgc aat 2688 
Asp Gin Val Asp Asp Met Ala Ala Arg Ser Gly Glu Asp Arg Arg Asn 
885 890 B95 

gtg gag cgc gee ctg gca ccc aac ctg taa 2718 
Val Glu Arg Ala Leu Ala Pro Asn Leu 
900 905 



<210> 50 
<211> 90S 
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<212> PRT 

<213> Ralstonia solanacearum 
<400> 50 

Met Thr Asp His Leu Met Arg Leu Ser Gly Leu Glu Pro Phe Asn He 
1 5 10 15 

Gly Glu Asp Thr Leu Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr 
20 25 30 

Gly Ser Lys Ala Phe Ala Arg Met He Leu Asn Ser Gin Phe Asp Glu 
35 40 45 

Ala Leu Ala Val Ala Arg Gin Gin Val Glu Asn Gly Ala Gin Val He 
50 55 60 

Asp He Asn Met Asp Glu Ala Met Leu Asp Ser Lys Ala Ala Met Val 
65 70 75 80 

Arg Phe Leu Asn Leu He Ala Ser Glu Pro Asp He Ala Arg Val Pro 
85 90 95 

He Met He Asp Ser Ser Lys Trp Glu Val He Glu Ala Gly Leu Lys 
100 105 110 

Cys Val Gin Gly Lys Ala He Val Asn Ser He Ser Leu Lys Glu Gly 
115 120 125 

Glu Glu Gin Phe Ala His His Ala Lys Leu He Lys Arg Tyr Gly Ala 
130 135 140 

Ala Ala Val Val Met Ala Phe Asp Glu Gin Gly Gin Ala Asp Thr Phe 
145 150 155 160 

Ala Arg Lys Thr Glu He Cys Lys Arg Ser Tyr Asp Phe Leu Val Asn 
165 170 175 

Gin Val Gly Phe Ala Pro Glu Asp He He Phe Asp Pro Asn He Phe 
180 185 190 

Ala Val Ala Thr Gly He Glu Glu His Asn Asn Tyr Ala Val Asp Phe 
195 200 205 

He Glu Ala Thr Arg Trp He Lys Gin Lys Leu Pro His Ala Lys Val 
210 215 220 

Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Asp Val 
225 230 235 240 

Val Arg Glu Ala He His Thr Val Phe Leu Tyr His Ala He Gly Ala 
245 250 255 

Gly Met Asp Met Gly He Val Asn Ala Gly Gin Leu Gly Val Tyr Glu 
260 265 270 

Asn Leu Ala Pro Glu Leu Arg Glu Arg Val Glu Abp Val Val Leu Asn 
275 280 285 

Arg Arg Pro Asp Ala Thr Asp Arg Leu Leu Glu He Ala Asp Arg Tyr 
290 295 300 

LyB Gly Gly Gly Ala Lys Arg Glu Glu Aen Leu Ala Trp Arg Gin Glu 
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305 



310 



315 



320 



Pro Val Qlu Lye Arg Leu Ala His Ala Leu Val His Gly He Thr Asp 
325 330 335 

Tyr Val Val Glu Asp Thr Glu Glu Val Arg Gin Lys He Phe Ala Ala 
340 345 350 

Gly Gly Arg Pro He Gin Val He Glu Gly Pro Leu Met Asp Gly Met 
355 360 365 

Asn He Val Gly Asp Leu Phe Gly Ala Gly Lys Met Phe Leu Pro Gin 
370 375 380 

Val Val Lys Ser Ala Arg Val Met Lys Gin Ala Val Ala His Leu He 
3 *5 390 395 400 

Pro Phe He Glu Glu Glu Lys Arg Gin He Ala Ala Ala Gly Gly Asp 
405 410 415 

Val Arg Ser Arg Gly Lys He Val He Ala Thr Val Lys Gly Asp Val 
420 425 * 430 

His Asp He Gly Lys Asn He Val Thr Val Val Leu Gin Cys Asn Asn 
435 440 445 

Phe Glu Val Val Asn Met Gly Val Met Val Pro Cys Asn Glu He Leu 
450 455 460 

Ala Lys Ala Lys Val Glu Gly Ala Asp lie He Gly Leu Ser Gly Leu 
465 470 475 480 

He Thr Pro Ser Leu Glu Glu Met Ala Tyr Val Ala Ser Glu Met Gin 
485 490 495 

Arg Asp Glu Tyr Phe Arg Val Lys Lys He Pro Leu Leu He Gly Gly 
500 505 510 

Ala Thr Thr Ser Arg Val His Thr Ala Val Lys He Ala Pro Asn Tyr 
515 520 525 

Glu Gly Pro Val Val Tyr Val Pro Asp Ala Ser Arg Ser Val Ser Val 
530 535 540 

Ala Ser Ser Leu Leu Ser Asp Glu Ala Ala Ala Arg Tyr He Glu Glu 
545 550 555 560 

Leu His Ala Asp Tyr Asp Arg He Arg Thr Gin Hie Ala Ser Lys Lys 
565 570 575 

Ala Met Pro Met Val Ser Leu Ala Ala Ala Arg Ala Asn Lys Thr Arg 
580 585 590 

He Asp Trp Ser Asn Tyr Thr Pro Pro Lys Pro Lys Phe Val Gly Arg 
595 600 605 

Arg Val Phe Arg Asn Tyr Asp Leu Asn Glu Leu Ala Gin Tyr He Asp 
610 615 620 

Trp Gly Pro Phe Phe Gin Thr Trp Asp Leu Ala Gly Lys Phe Pro Asp 



625 



630 



635 



640 
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He Leu Asn Asp Ala He Val Gly Glu Ser Ala Arg Arg Val Phe Ser 
€45 650 " 655 

Asp, Gly Lys Ser Met Leu Ala Arg Leu He Ala Gly Arg Trp 'Leu Thr 
660 665 670 

Ala Asn Gly Val He Ala Leu Leu Pro Ala Asn Thr Val Asn Asp Asn 
675 680 685 

Asp He Glu He Tyr Thr Asp Glu Thr Arg Ser Glu Val Aid Leu Thr 
690 695 700 

Trp Arg Asn He Arg Gin Gin Ser Glu Arg Pro He He Asp Gly Val 
7 °* 710 715 720 

Met Arg Pro Asn Arg Cys Leu Ala Asp Phe lie Ala Pro Lys Asp Thr 
725 730 735 

Gly He Ala Asp Tyr He Gly Leu Phe Ala Val Thr Gly Gly He Gly 
740 745 750 

He Asp Lys Arg Glu Ala Ala Phe Glu Ala Asp His Asp Asp Tyr Ser 
755 760 765 

Ala He Met Leu Lys Ala Leu Ala Asp Arg Phe Ala Glu Ala Phe Ala 
770 775 780 

Glu Cys Leu His Ala Arg Val Arg Arg Asp Leu Trp Gly Tyr Ala Gin 
785 790 795 800 

Asp Glu Thr Leu Asp Asn Asp Ala Leu He Arg Glu Glu Tyr Arg Gly 
805 810 815 

He Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Glu His Thr Val Lys 
820 825 830 

Arg Asp Leu Phe Arg Val Leu Asp Ala Gin Glu He Gly Met Asn Leu 
835 840 845 

Thr Glu Ala Leu Ala Met Thr Pro Ala Ala Ser Val Ser Gly Phe Gin 
850 855 860 

Leu Ser His Pro Asp Ser Thr Tyr Phe Thr He Gly Lys He Gly Gin 
865 870 875 880 

Asp Gin Val Asp Asp Met Ala Ala Arg Ser Gly Glu Asp Arg Arg Asn 
885 890 895 



Val Glu Arg Ala Leu Ala Pro Asn Leu 
900 905 



<210> 51 
<211> 3645 
<212> DNA 

<213> Chlorobium tepidum 

<220> 
<221> CDS 
<222> (1) . . (3642) 
<223> RCL00420 
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<400> 51 

gtg etc gac ggg gec atg ggc acc atg ate cag agg cat ggc etc gac 48 
Val Leu Asp Gly Ala Met Gly Thr Met He Gin Arg His Gly Leu Asp 
1 5 io is 

gaa cag gac tac egg ggc gag cgt ttc get teg cat gac cat ccg ctg 96 
Glu Gin Asp Tyr Arg Gly Glu Arg Phe Ala Ser His Asp His Pro Leu 
20 25 30 

aag ggc aac aac gac ctt ctt gtc ate acc egg ccc gac ate ate cgt 144 
Lys Gly Asn Asn Asp Leu Leu Val He Thr Arg Pro Asp He He Arg 
35 40 " 45 

teg ate cac tgc gac ttc etc gac gcg ggt gcg gac ate ate gag acc 192 
Ser lie His Cys Asp Phe Leu Asp Ala Gly Ala Asp He He Glu Thr 
50 55 60 

tgc acc ttc aac gec aac ccg ate teg cag teg gac tac cag ttg cag 240 
Cys Thr Phe Asn Ala Asn Pro He Ser Gin Ser Asp Tyr Gin Leu Gin 
65 70 75 80 

gac ttg acc cgc gag ctg aac gtg gcg gcg gca aag ata gec cgc teg 288 
Asp Leu Thr Arg Glu Leu Asn Val Ala Ala Ala Lys He Ala Arg Ser 
85 90 95 

gca gcg gac gag ttc acc gca aag act ccc gac aag ccg cgt ttc gtg 336 
Ala Ala Asp Glu Phe Thr Ala Lys Thr Pro Asp Lys Pro Arg Phe Val 
100 105 110 

gee ggt tec ate gga ccg acc aac aag acg etc teg etc teg ccg gac 384 
Ala Gly Ser He Gly Pro Thr Asn Lys Thr Leu Ser Leu Ser Pro Asp 
115 120 125 

gtg aac aac ccc ggc ttc cgc gee gtc acc ttc cag gag atg gtc gat 432 
Val Asn Asn Pro Gly Phe Arg Ala Val Thr Phe Gin Glu Met Val Asp 
130 135 140 

aac tac act gee cag etc gaa ggc ttg cac gag ggc ggt gtc gat etc 480 
Asn Tyr Thr Ala Gin Leu Glu Gly Leu His Glu Gly Gly Val Asp Leu 
145 150 155 160 

ttg etc gtc gag acg gtg ttc gac aca ctg aac tgc aag gcg gcg etc 528 
Leu Leu Val Glu Thr Val Phe Asp Thr Leu Asn Cys Lys Ala Ala Leu 
165 170 175 

tac get ate gag gag tac gcg gtg aaa acc ggc tgg cag gtg ccc gtg 576 
Tyr Ala He Glu Glu Tyr Ala Val Lys Thr Gly Trp Gin Val Pro Val 
180 185 190 

atg gtc tec ggc acg gtg gtg gac gcg age ggc cgc acc etc tec ggc 624 
Met Val Ser Gly Thr Val Val Asp Ala Ser Gly Arg Thr Leu Ser Gly 
195 200 205 

caa acc acc gag gcg ttc tgg att teg att teg cac atg ccg agt ctg 672 
Gin Thr Thr Glu Ala Phe Trp He Ser He Ser His Met Pro Ser Leu 
210 215 220 

etc teg gtc ggc ctg aac tgc gca etc ggc tec aag cag atg cgc ccc 720 
Leu Ser Val Gly Leu Asn Cys Ala Leu Gly Ser Lys Gin Met Arg Pro 
225 230 235 240 

ttc ate gag gcg etc teg aac ate gee gaa age tac gtc age gtc tat 768 
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Phe He Glu Ala Leu Ser Aen He Ala Glu Ser Tyr Val Ser Val Tyr 
245 250 255 

ccc aac gcg ggc ctg ccg aat gag ttc ggc gag tac. gac gac -tec ccc 816 
Pro Asn Ala Oly Leu Pro Asn Glu Phe Gly Glu Tyr Asp Asp Ser Pro 
260 265 * 270 

gag tac atg gec gcg cag ate gcg ggc ttc gec gaa tea ggc ttc gtg 864 
Glu Tyr Met Ala Ala Gin He Ala Gly Phe Ala Glu Ser Gly Phe Val 
275 280 285 

aac ate gtc ggc ggc tgc tgc ggc ace acg ccg acg cac ate cgc gec 912 
Asn He Val Gly Gly Cys Cys Gly Thr Thr Pro Thr His He Arg Ala 
290 295 300 

att gec gaa gcg gtc aag act etc ccg ccg aga aag cgc ccc gec aac 960 
He Ala Glu Ala Val Lys Thr Leu Pro Pro Arg Lys Arg Pro Ala Asn 
305 310 315 ~ 320 

aag cac gtg ctg agg etc tec ggc etc gaa ccg etc gtg gtt gac gaa 1008 
Lys His Val Leu Arg Leu Ser Gly Leu Glu Pro Leu Val Val Asp Glu 
325 330 335 

ace ace ggc ttc ate aac gtc ggc gag cgc ace aac gtc ace ggt teg 1056 
Thr Thr Gly Phe He Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser 
340 345 350 , 

cgc aag ttc gee cgc etc ate aag gag gec aat tac gac gaa. gcg etc 1104 
Arg Lys Phe Ala Arg Leu He Lys Glu Ala Asn Tyr Asp Glu Ala Leu 
355 360 365 

tec att gee cgc cag cag gtc gag aac ggc gcg cag gtg ate gac gtg 1152 
Ser He Ala Arg Gin Gin Val Glu Asn Gly Ala Gin Val He Asp Val 
370 375 380 

aac etc gac gaa gga atg etc gac tec gaa aag gtg ate gtc gaa ttc 1200 
Asn Leu Asp Glu Gly Met Leu Asp Ser Glu Lys Val He Val Glu Phe 
385 390 395 400 

ctg aac etc ate gec tec gag cct gag ate gec aag gtg ccg gtg atg 1248 
Leu Asn Leu He Ala Ser Glu Pro Glu He Ala Lys Val Pro Val Met 
405 410 415 

ate gac teg teg aaa tgg teg gtc ate gaa aac ggc ctg cgc tgc ace 1296 
He Asp Ser Ser Lys Trp Ser Val He Glu Asn Gly Leu Arg Cys Thr 
420 425 430 

cag ggc aag age ate gtc aac teg ate age etc aag gag ggc gag gag 1344 
Gin Gly Lys Ser He Val Asn Ser He Ser Leu Lys Glu Gly Glu Glu 
435 440 445 

ctg ttc aag gag cgc get cgc aag ate atg caa tac ggc gcg gcg gcg 1392 
Leu Phe Lys Glu Arg Ala Arg Lys He Met Gin Tyr Gly Ala Ala Ala 
450 455 460 

gtg gtc atg gee ttc gac gag cag ggc cag gee gac age ctg cac cgc 1440 
Val Val Met Ala Phe Asp Glu Gin Gly Gin Ala Asp Ser Leu His Arg 
465 470 475 480 

cgc ate gag att tgc age cgc gec tac aaa att etc ace gaa gag gtg 1488 
Arg lie Glu He Cys Ser Arg Ala Tyr Lys He Leu Thr Glu Glu Val 
485 490 495 
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ggc ttc ccg ccg gag gac ate ate ttt gac ccg aac gtg ctg ace gtg 1S36 
Gly Phe Pro Pro Glu Asp He He Phe Asp Pro Aon Val Leu Thr Val 
500 505 510 

gec acc ggc ate gac gag cac aac aac tac gcg etc gac ttc ate gaa 1584 
Ala Thr Gly He Asp Glu His Asn Asn Tyr Ala Leu Asp Phe He Glu 
515 520 525 

age gtg cgc tgg ate aag cag aac ctg ccg cac gcg aag gtc tec ggc 1632 
Ser Val Arg Trp He Lys Gin Asn Leu Pro His Ala Lys Val Ser Gly 
530 535 540 

ggc ate age aac gtt teg ttc tec ttc cgc ggc aac gag ccg gtg cgc 1680 
Gly He Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Glu Pro Val Arg 
545 550 555 560 

gag gcg atg cac acc gcg ttc etc tac cac gee ate cac gee ggt etc 1728 
Glu Ala Met His Thr Ala Phe Leu Tyr His Ala He His Ala Gly Leu 
565 570 575 

gac atg ggc ate gtc aac gee gee cag ctt ggc ate tac gaa gag ate 1776 
Asp Met Gly He Val Asn Ala Ala Gin Leu Gly He Tyr Glu Glu He 
560 585 590 

gac ccg gag ctt ctt gtc tat gtc gag gac gtg ctg ctg aac cgc cgc 1824 
Asp Pro Glu Leu Leu Val Tyr Val Glu Asp Val Leu Leu Asn Arg Arg 
595 600 605 

gac gac gee acc gag egg etc gtg gcg ttc get gaa acg ate cgc gac 1872 
Asp Asp Ala Thr Glu Arg Leu Val Ala Phe Ala Glu Thr He Arg Asp 
610 615 620 

ggc ggc gaa aag gee gag gee aag aac gee gaa tgg cgc aac gee ccg 1920 
Gly Gly Glu Lys Ala Glu Ala Lys Asn Ala Glu Trp Arg Asn Ala Pro 
625 630 635 640 

gtc gag gag egg ctg aaa cac gcg etc gtc aag ggc ate gtt gac tac 1968 
Val Glu Glu Arg Leu Lys His Ala Leu Val Lys Gly He Val Asp Tyr 
645 650 655 

ate gac gag gac acc gaa gag gee cgc cag etc tac ccg agt ccg ctg 2016 
He Asp Glu Asp Thr Glu Glu Ala Arg Gin Leu Tyr Pro Ser Pro Leu 
660 665 670 

gag gtg ate gag ggg ccg etc atg aac ggc atg aac cac gtc ggc gac 2064 
Glu Val He Glu Gly Pro Leu Met Asn Gly Met Asn His Val Gly Asp 
675 680 685 

etc ttc gee gaa ggc aag atg ttc ctg cca cag gtg gtc aaa age gee 2112 
Leu Phe Ala Glu Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala 
690 695 700 

cgc gtc atg aag cgc teg gta get gcg ctg att ccc tat ate gag gag 2160 
Arg Val Met Lys Arg Ser Val Ala Ala Leu He Pro Tyr He Glu Glu 
705 710 715 720 

gag aag teg aaa aac tgc gac acg age gec aaa gee aag gtg ctg etc 2208 
Glu Lys Ser Lys Asn Cys Asp Thr Ser Ala Lys Ala Lys Val Leu Leu 
725 730 735 . 



gee acg gtg aag ggc gac gtg cac gac ate ggc aag aac ate gtg teg 



2256 
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Ala Thr Val Lys Gly Asp Val His Asp He Gly Lys Asn He Val Ser 
740 745 750 

gtg. gtg ctt gcc tgc aac aac ttc gac gtg ate gac ate ggc gtc atg 2304 
Val Val Leu Ala Cys Asn Asn Phe Asp Val He Asp He Gly Val Met 
755 760 765 

atg cca tgc gac aag att etc gaa gcg ctg gca gaa cac aag ccc gac 2352 
Met Pro Cys Asp Lys He Leu Glu Ala Leu Ala Glu His Lys Pro Asp 
770 775 780 



gtg etc ggc etc tec ggc etc ate acc ccg teg etc gaa gag atg gcg 
Val Leu Gly Leu Ser Gly Leu He Thr Pro Ser Leu Glu Glu Met Ala 
785 790 795 600 



age gtg ccg gtg gtc age aac etc tgc aac ccc gcc cag cgc 'gac age 
Ser Val Pro Val Val Ser Asn Leu Cys Asn Pro Ala Gin Arg Asp Ser 
850 855 860 



acc aaa etc ttc aac gac gcc acc get ctg etc gac egg ate gac age 
Thr Lys Leu Phe Asn Asp Ala Thr Ala Leu Leu Asp Arg He Asp Ser 
965 970 975 



2400 



cac gtg gcc aaa gag atg gag egg etc ggc atg aac att ccg etc ate 2446 
His Val Ala Lys Glu Met Glu Arg Leu Gly Met Asn He Pro Leu He 
605 810 815 

ate ggc ggc gcg acc acc teg aag gtg cac acg gcg gtg aaa etc gcg 24 96 
He Gly Gly Ala Thr Thr Ser Lys Val His Thr Ala Val Lys Leu Ala 
820 825 830 

ccc tgc tac ccc age ggc gcg gta gta cac gtg etc gac gcc teg cgc 2544 
Pro Cys Tyr Pro Ser Gly Ala Val Val His Val Leu Asp Ala Ser Arg 
635 840 845 



2592 



tat ate gcg gcg ctg aag gat gag cag gag gcg atg cgc aag age cac 2640 
Tyr He Ala Ala Leu Lys Asp Glu Gin Glu Ala Met Arg Lys Ser His 
870 875 * 880 

gcc gag cgc atg gcg gca aaa aag tac gtc teg etc gac gcc gcc cgc 2688 
Ala Glu Arg Met Ala Ala Lys Lys Tyr Val Ser Leu Asp Ala Ala Arg 
685 890 895 

gac aac cgc etc acc att gac tgg gag gcc gaa acc ate gac aag ccc 2736 
Asp Asn Arg Leu Thr He Asp Trp Glu Ala Glu Thr He Asp Lys Pro 
900 905 910 

gcc cag act ggc gtc acc gtg ctg gag gat gtc acc gtc ggc gcg etc 2784 
Ala Gin Thr Gly Val Thr Val Leu Glu Asp Val Thr Val Gly Ala Leu 
915 920 925 

cgc ccg tat ate gac tgg gca mcc ttc ttc tgg age tgg gag ctg cac 2832 
Arg Pro Tyr He Asp Trp Ala Xaa Phe Phe Trp Ser Trp Glu Leu His 
930 935 940 

ggc gtc tat ccg cag att ctg gag gat gaa aag gtc ggc gag gag gca 2880 
Gly Val Tyr Pro Gin He Leu Glu Asp Glu LyB Val Gly Glu Glu Ala 
945 950 955 960 



2928 



gaa aag ctg etc ggc ate aaa ggc gtg gcg ggc ate ttc ccg gcc aac 2976 
Glu Lys Leu Leu Gly He Lys Gly Val Ala Gly lie Phe Pro Ala Asn 
980 985 990 



WO 03/087386 



PCT/EP03/04010 



228 

age ate ggc gac gac ate ttc gtc tat gcg gat gac gag cqc tea at* 3Ma 
ser He Gly Asp Asp lie Phe Val Tyr III Lp S! Arg S lit ^ 
995 1000 loos 

ti C t 9 ° if" 9t ? Ct9 C3C acc ct 9 cgc ca 9 caa 33C gaa aag cac ggc 3072 
lie Arg Thr Val Leu His Thr Leu Arg Gin Gin Sly Slu Lyf £s 111 

1015 1020 

III 1,1 r CtC 9 f 9 Ct9 909 93C ttC 9tg 9" cc 9 gaa age ggc 3120 
Glu Ala Asn Leu Ala Leu Ala Asp Phe Val Ala Pro Arg Glu sir S?y 

1030 1035 1040 

gtc aac gac tgg ate ggc tgc ttc acc gta acc gee gga etc ggc ate 3i«n 
Val Asn Asp Trp He Gly Cys Phe Thr Val Thr Ala Sly Hi ?ly tit 
1045 10S0 105 5 

cag aat ttg etc gac gag ttc aca gca gag aac gac gac tac cac cgc 3216 
Gin Asn Leu^Leu Asp Glu Phe Thr^Ala Glu Asn Lp Lp Tyr His 

ate atg aca cag gcg etc gee gac cga ctg gee gaa gcg ttc oca «a 
lie Met Thr Gin Ala Leu Ala Asp Arg Le? La Lu IS p£e IS SI 
1075 10B0 ioes 

m!? r t9 £? C Hf 3 389 9tg cgc C9C 9aa ctc 99C tac gcg ccc ggc 3312 

Snoo 8 G1U LyS Val *** Glu Leu **P «y Tyr III Pro 111 
X09 ° 1095 1100 

gaa ate etc ggc aac gaa gag ctg ate gee gaa aag tac cga ggc ate 3360 
Glune Leu Gly Asn Glu^Glu Leu lie Ala Glu_Ly B Tyr Arg Gly ill 

a™ III !?f » CC 2?° I a ° CCC 9CC t9c cc 9 9 a <= cac acc gaa aag gca 3408 
Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Asp His Thr Glu Lys La 

112 5 1130 i 13s 

HI Hi lt° ! a ° f t9 aaC 9Ct 9aa 9C 9 9cc acc ggc gtc aeg ctg 3456 
He He Phe Asp Leu Leu Asn Ala Glu Ala Ala Thr Gly Val Thr Leu 
1140 H45 uso 

aeg gaa act ttc gcg atg aac ccc gca gee tea gtc tgc ggc ctc tac 350* 
Thr Glu Thr Phe Ala Met Asn Pro Ala Ala Ser Val Cys Sly III t£ 
1155 1160 lies 

ttc gee aac ccg gee teg aaa tac ttc gta ctc ggc aag att ggt aag 3552 
Phe Ala Asn Pro Ala Ser Lys Tyr Phe Val Leu Sy Lys lie Sy lyl " 
1170 H'5 iieo 

gat cag gtc gaa gac tac gee aac cgc aaa ggg ctg gaa gta gca aaa 3i;oo 
Asp Gin Val Glu Asp Tyr Ala Asn Arg Lys Sly LeJ lit Val E K 36 °° 
1185 1190 1195 1200 

gee gag aag tgg etc gcg ccc teg ctg aac tac gat cca gcg 3642 
Ala Glu Lys Trp Leu Ala Pro Ser Leu Asn Tyr Asp Pro All 
1205 i 210 

taa 

3645 



<210> 52 
<211> 1214 
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<212> PRT 

<213> Chlorobium tepidum 
<220> 

<221> unsure 
<222> 936 936 

<223> All occurrences of Xaa indicate any amino acid 
<400> 52 

Val Leu Asp Gly Ala Met Gly Thr Met He Gin Arg His Gly Leu Asp 
1 5 10 is 

Glu Gin Asp Tyr Arg Gly Glu Arg Phe Ala Ser His Asp His Pro Leu 
20 25 30 

Lys Gly Asn Asn Asp Leu Leu Val He Thr Arg Pro Asp He He Arg 
35 40 45 

Ser He His Cys Asp Phe Leu Asp Ala Gly Ala Asp He He Glu Thr 
50 55 60 

Cys Thr Phe Asn Ala Asn Pro He Ser Gin Ser Asp Tyr Gin Leu Gin 
€5 70 75 80 

Asp Leu Thr Arg Glu Leu Asn Val Ala Ala Ala Lys lie Ala Arg Ser 
65 90 95 

Ala Ala Asp Glu Phe Thr Ala Lys Thr Pro Asp Lys Pro Arg Phe Val 
100 105 " * no 

Ala Gly Ser He Gly Pro Thr Asn Lys Thr Leu Ser Leu Ser Pro Asp 
115 120 125 

Val Asn Asn Pro Gly Phe Arg Ala Val Thr Phe Gin Glu Met Val Asp 
130 135 140 

Asn Tyr Thr Ala Gin Leu Glu Gly Leu His Glu Gly Gly Val Asp Leu 
145 150 155 160 

Leu Leu Val Glu Thr Val Phe Asp Thr Leu Asn Cys Lys Ala Ala Leu 
165 170 175 

Tyr Ala He Glu Glu Tyr Ala Val Lys Thr Gly Trp Gin Val Pro Val 
180 185 190 

Met Val Ser Gly Thr Val Val Asp Ala Ser Gly Arg Thr Leu Ser Gly 
195 200 205 

Gin Thr Thr Glu Ala Phe Trp He Ser He Ser His Met Pro Ser Leu 
210 215 220 

Leu Ser Val Gly Leu Asn Cys Ala Leu Gly Ser Lys Gin Met Arg Pro 
225 230 235 240 

Phe He Glu Ala Leu Ser Asn He Ala Glu Ser Tyr Val Ser Val Tyr 
245 250 255 

Pro Asn Ala Gly Leu Pro Asn Glu Phe Gly Glu Tyr Asp Asp Ser Pro 
260 265 270 

Glu Tyr Met Ala Ala Gin He Ala Gly Phe Ala Glu Ser Gly Phe Val 
275 280 285 
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Asn lie Val Gly Gly Cys Cys Gly Thr Thr Pro Thr His He Arg Ala 
290 295 300 

He Ala Glu Ala Val Lys Thr Leu Pro Pro Arg Lye Arg Pro Ala Aan 
305 310 315 320 

Lys His Val Leu Arg Leu Ser Gly Leu Glu Pro Leu Val Val Asp Glu 
325 330 335 

Thr Thr Gly Phe He Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser 
340 34S 350 

Arg Lys Phe Ala Arg Leu He Lys Glu Ala Asn Tyr Asp Glu Ala Leu 
355 360 365 

Ser He Ala Arg Gin Gin Val Glu Asn Gly Ala Gin Val He Asp Val 
370 375 380 

Asn Leu Asp Glu Gly Met Leu Asp Ser Glu Lys Val He Val Glu Phe 
385 390 395 400 

Leu Asn Leu He Ala Ser Glu Pro Glu He Ala Lys Val Pro Val Net 
405 410 415 

He Asp Ser Ser Lys Trp Ser Val He Glu Asn Gly Leu Arg Cys Thr 
420 425 430 

Gin Gly Lys Ser He Val Asn Ser He Ser Leu Lys Glu Gly Glu Glu 
435 440 445 

Leu Phe Lys Glu Arg Ala Arg Lys He Met Gin Tyr Gly Ala Ala Ala 
450 455 460 

Val Val Met Ala Phe Asp Glu Gin Gly Gin Ala Asp Ser Leu His Arg 
465 470 475 480 

Arg He Glu He Cys Ser Arg Ala Tyr Lys He Leu Thr Glu Glu Val 
485 490 495 

Gly Phe Pro Pro Glu Asp He He Phe Asp Pro Asn Val Leu Thr Val 
500 505 510 

Ala Thr Gly He Asp Glu His Asn Asn Tyr Ala Leu Asp Phe He Glu 
515 520 525 

Ser Val Arg Trp He Lys Gin Asn Leu Pro His Ala Lys Val Ser Gly 
530 535 540 

Gly He Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Glu Pro Val Arg 
545 550 555 560 

Glu Ala Met His Thr Ala Phe Leu Tyr His Ala He His Ala Gly Leu 
565 570 575 

Asp Met Gly He Val Asn Ala Ala Gin Leu Gly He Tyr Glu Glu He 
580 585 590 

Asp Pro Glu Leu Leu Val Tyr Val Glu Asp Val Leu Leu Asn Arg Arg 
595 600 605 

Asp Asp Ala Thr Glu Arg Leu Val Ala Phe Ala Glu Thr He Arg Asp 
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610 



615 



620 



Gly Gly Glu LyB Ala Glu Ala Lye Asn Ala Glu Trp Arg Asn Ala Pro 
"5 630 6 35 ■ , 640 

Val Glu Glu Arg Leu Lys His Ala Leu Val Lys Gly He Val Asp Tyr 
645 650 655 

He Asp Glu Asp Thr Glu Glu Ala Arg Gin Leu Tyr Pro Ser Pro Leu 
660 665 67-0 

Glu Val He Glu Gly Pro Leu Met Asn Gly Met Asn His Val Gly Asp 
675 680 685 

Leu Phe Ala Glu Gly Lys Met Phe Leu Pro Gin Val Val Lys Ser Ala 
690 695 700 

Arg Val Met Lys Arg Ser Val Ala Ala Leu He Pro Tyr He Glu Glu 
705 710 715 * 720 

Glu Lys Ser Lys Asn CyB Asp Thr Ser Ala Lys Ala Lys Val Leu Leu 
725 730 735 

Ala Thr Val Lys Gly Asp Val His Asp He Gly Lys Asn He Val Ser 
740 745 750 

Val Val Leu Ala Cys Asn Asn Phe Asp Val He Asp He Gly Val Met 
755 760 765 

Met Pro Cys Asp LyB He Leu Glu Ala Leu Ala Glu His Lys Pro Asp 
770 775 780 

Val Leu Gly Leu Ser Gly Leu He Thr Pro Ser Leu Glu Glu Met Ala 
785 790 795 800 

His Val Ala Lys Glu Met Glu Arg Leu Gly Met Asn He Pro Leu He 
805 810 815 

lie Gly Gly Ala Thr Thr Ser Lys Val His Thr Ala Val Lys Leu Ala 
820 825 830 

Pro Cys Tyr Pro Ser Gly Ala Val Val His Val Leu Asp Ala Ser Arg 
835 840 845 

Ser Val Pro Val Val Ser Asn Leu Cys Asn Pro Ala Gin Arg Asp Ser 
850 855 860 

Tyr He Ala Ala Leu Lys Asp Glu Gin Glu Ala Met Arg Lys Ser His 
865 870 875 ~ 880 

Ala Glu Arg Met Ala Ala Lys Lys Tyr Val Ser Leu Asp Ala Ala Arg 
885 890 895 

Asp Asn Arg Leu Thr He Asp Trp Glu Ala Glu Thr lie Asp Lys Pro 
900 905 910 

Ala Gin Thr Gly Val Thr Val Leu Glu Asp Val Thr Val Gly Ala Leu 
915 920 925 

Arg Pro Tyr He Asp Trp Ala Xaa Phe Phe Trp Ser Trp Glu Leu His 



930 



935 



940 
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Gly Val Tyr Pro Gin He Leu Glu Asp Glu Lys Val Gly Glu Glu Ala 
945 950 955 960 

Thr Lys Leu Phe Aen Asp Ala Thr Ala Leu Leu Asp Arg He Asp Ser 
965 970 " 975 

Glu Lys Leu Leu Gly He Lys Gly Val Ala Gly He Phe Pro Ala Asn 
980 985 990 

Ser He Gly Asp Asp He Phe Val Tyr Ala Asp Asp Glu Arg Ser lie 
995 1000 1005 

He Arg Thr Val Leu His Thr Leu Arg Gin Gin Gly Glu Lys His Gly 
1010 1015 1020 

Glu Ala Asn Leu Ala Leu Ala Asp Phe Val Ala Pro Arg Glu Ser Gly 
*025 1030 1035 "* 1040 

Val Asn Asp Trp He Gly Cys Phe Thr Val Thr Ala Gly Leu Gly He 
1045 1050 1055 

Gin Asn Leu Leu Asp Glu Phe Thr Ala Glu Asn Asp Asp Tyr His Arg 
1060 1065 1070 

He Met Thr Gin Ala Leu Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu 
1075 1080 1085 

Met Leu His Glu Lys Val Arg Arg Glu Leu Trp Gly Tyr Ala Pro Gly 
1090 1095 1100 

Glu He Leu Gly Asn Glu Glu Leu lie Ala Glu Lys Tyr Arg Gly lie 
H05 1110 ins * 1120 

Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Asp His Thr Glu Lys Ala 
1125 1130 1135 

He He Phe Asp Leu Leu Asn Ala Glu Ala Ala Thr Gly Val Thr Leu 
1140 H45 1150 

Thr Glu Thr Phe Ala Met Asn Pro Ala Ala Ser Val Cys Gly Leu Tyr 
1155 1160 1165 

Phe Ala Asn Pro Ala Ser Lys Tyr Phe Val Leu Gly Lys He Gly Lys 
1170 1175 1180 

Asp Gin Val Glu Asp Tyr Ala Asn Arg Lys Gly Leu Glu Val Ala Glu 
1185 1190 H95 1200 

Ala Glu Lys Trp Leu Ala Pro Ser Leu Asn Tyr Asp Pro Ala 
1205 1210 



<210> 53 
<211> 52 
<212> DNA 

<213> Kunstliche Sequenz 
<220> 

<223> Beschreibung der kunstlichen Sequenz :PCR primer 



<400> 53 

cccgggatcc gctagcggcg cgccggccgg cccggtgtga aataccgcac ag 52 
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<210> 54 

<2U> 53 

<212> DNA 

<213> Kunstliche Sequenz 
<220> 

<223> Beschreibung der kunBtlichen Sequenz : PGR primer 

<400> 54 

tctagactcg agcggccgcg gccggccttt aaattgaaga cgaaagggcc teg 53 

<210> 55 
<2U> 47 
<212> DNA 

<213> Kunstliche Sequenz 
<220> 

<223> Beschreibung der kunstlichen Sequenz :PCR primer 
<400> 55 

gagatctaga cccggggatc cgctagcggg ctgetaaagg aagcgga 47 

<210> 56 
<211> 38 
<212> DNA 

<213> Kunstliche Sequenz 

<220> 

<223> Beschreibung der kunstlichen Sequenz :PCR primer 
<400> 56 

gagaggegeg ccgctagcgt gggcgaagaa ctccagca 38 

<210> 57 
<2U> 34 
<212> DNA 

<213> Kunstliche Sequenz 
<220> 

<223> Beschreibung der kunstlichen Sequenz :PCR primer 
<400> 57 

gagagggegg ccgcgcaaag tcccgcttcg tgaa 34 

<210> 58 
<211> 34 
<212> DNA 

<213> Kunstliche Sequenz 
<220> 

<223> Beschreibung der kunstlichen Sequenz :PCR primer 



<400> 58 

gagagggegg ccgctcaagt cggtcaagcc aege 



34 
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<210> 59 
<211> 140 
<212> DNA 

<213> KtaBtliche Sequen2 
<220> 

<223> Beschreibung der kunstlichen Sequenz :PCR primer 
<400> 59 

tcgaatttaa atctcgagag gcctgacgtc gggcccggta ccacgcgtca tatgactagt 60 
tcggacctag ggatatcgtc gacatcgatg ctcttctgcg ttaattaaca attgggatcc 120 
tctagacccg ggatttaaat 140 



<210> 60 
<211> 140 
<212> DNA 

<213> KtaBtliche Sequenz 
<220> 

<223> Beschreibung der kunstlichen Sequenz: PGR primer 
<400> 60 

gatcatttaa atcccgggtc tagaggatcc caattgttaa ttaacgcaga agagcatcga 60 
tgtcgacgat atccctaggt ccgaactagt catatgacgc gtggtaccgg gcccgacgtc 120 
aggcctctcg agatttaaat 140 



<210> 61 
<211> 33 
<212> DNA 

<213> KtaBtliche Sequenz 
<220> 

<223> Beschreibung der kunstlichen Sequenz :PCR primer 
<400> 61 

gagagcggcc gccgatcctt tttaacccat cac 33 



<210> 62 
<211> 32 
<212> DNA 

<213> Ktastliche Sequenz 
<220> 

<223> Beschreibung der kunstlichen Sequenz:PCR primer 
<400> 62 

aggagcggcc gccatcggca ttttcttttg eg 32 



<210> 63 
<211> 5091 
<212> DNA 

<213> KtaBtliche Sequenz 
<220> 

<223> Beschreibung der kOnstlichen Sequenz :Plasmid 



<400> 63 

gccgcgactg ccttcgcgaa gccttgcccc gcggaaattt cctccaccga gttcgtgcac 60 
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acccctatgc caagcttctt tcaccctaaa ttcgagagat tggattctta ccgtggaaat 120 
tcttcgcaaa aatcgtcccc tgatcgccct tgcgacgttg gcgtcggtgc cgctggttgc 180 
gcttggcttg accgacttga tcagcggccg ctcgatttaa atctcgagag gcctgacgtc 240 
gggcccggta ccacgcgtca tatgactagt tcggacctag ggatatcgtc gacatcgatg 300 
ctcttctgcg ttaattaaca attgggatcc tctagacccg ggatttaaat cgctagcggg 360 
ctgctaaagg aagcggaaca cgtagaaagc cagtccgcag aaacggtgct gaccccggat 420 
gaatgtcagc tactgggcta tctggacaag ggaaaacgca agcgcaaaga gaaagcaggt 480 
agcttgcagt gggcttacat ggcgatagct agactgggcg gttttatgga cagcaagcga 540 
accggaattg ccagctgggg cgccctctgg taaggttggg aagccctgca aagtaaactg 600 
gatggctttc ttgccgccaa ggatctgatg gcgcagggga tcaagatctg' atcaagagac 660 
aggatgagga tcgtttcgca tgattgaaca agatggattg cacgcaggtt ctccggccgc 720 
ttgggtggag aggctattcg gctatgactg ggcacaacag acaatcggct gctctgatgc 780 
cgccgtgttc cggctgtcag cgcaggggcg cccggttctt tttgtcaaga ccgacctgtc 840 
cggtgccctg aatgaactgc aggacgaggc agcgcggcta tcgtggctgg ccacgacggg 900 
cgttccttgc gcagctgtgc tcgacgttgt cactgaagcg ggaagggact ggctgctatt 960 
gggcgaagtg ccggggcagg atctcctgtc atctcacctt gctcctgccg agaaagtatc 1020 
catcatggct gatgcaatgc ggcggctgca tacgcttgat ccggctacct gcccattcga 1080 
ccaccaagcg aaacatcgca tcgagcgagc acgtactcgg atggaagccg gtcttgtcga 1140 
tcaggatgat ctggacgaag agcatcaggg gctcgcgcca gccgaactgt tcgccaggct 1200 
caaggcgcgc atgcccgacg gcgaggatct cgtcgtgacc catggcgatg cctgcttgcc 1260 
gaatatcatg gtggaaaatg gccgcttttc tggattcatc gactgtggcc ggctgggtgt 1320 
ggcggaccgc tatcaggaca tagcgttggc tacccgtgat attgctgaag agcttggcgg 1380 
cgaatgggct gaccgcttcc tcgtgcttta cggtatcgcc gctcccgatt cgcagcgcat 1440 
cgccttctat cgccttcttg acgagttctt ctgagcggga ctctggggtt cgaaatgacc 1500 
gaccaagcga cgcccaacct gccatcacga gatttcgatt ccaccgccgc cttctatgaa 1560 
aggttgggct tcggaatcgt tttccgggac gccggctgga tgatcctcca gcgcggggat 1620 
ctcatgctgg agttcttcgc ccacgctagc ggcgcgccgg ccggcccggt gtgaaatacc 1680 
gcacagatgc gtaaggagaa aataccgcat caggcgctct tccgcttcct cgctcactga 1740 
ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca gctcactcaa aggcggtaat 1800 
acggttatcc acagaatcag gggataacgc aggaaagaac atgtgagcaa aaggccagca 1860 
aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc tccgcccccc 1920 
tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga caggactata 1980 
aagataccag gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc 2040 
gcttaccgga tacctgtccg cctttctccc ttcgggaagc gtggcgcttt ctcatagctc 2100 
acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga 2160 
accccccgtt cagcccgacc gctgcgcctt atccggtaac tatcgtcttg agtccaaccc 2220 
ggtaagacac gacttatcgc cactggcagc agccactggt aacaggatta gcagagcgag 2280 
gtatgtaggc ggtgctacag agttcttgaa gtggtggcct aactacggct acactagaag 2340 
gacagtattt ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa gagttggtag 2400 
ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt gcaagcagca 2460 
gattacgcgc agaaaaaaag gatctcaaga agatcctttg atcttttcta cggggtctga 2520 
cgctcagtgg aacgaaaact cacgttaagg gattttggtc atgagattat caaaaaggat 2580 
cttcacctag atccttttaa aggccggccg cggccgcgca aagtcccgct tcgtgaaaat 2640 
tttcgtgccg cgtgattttc cgccaaaaac tttaacgaac gttcgttata atggtgtcat 2700 
gaccttcacg acgaagtact aaaattggcc cgaatcatca gctatggatc tctctgatgt 2760 
cgcgctggag tccgacgcgc tcgatgctgc cgtcgattta aaaacggtga tcggattttt 2820 
ccgagctctc gatacgacgg acgcgccagc atcacgagac tgggccagtg ccgcgagcga 2880 
cctagaaact ctcgtggcgg atcttgagga gctggctgac gagctgcgtg ctcggccagc 2940 
gccaggagga cgcacagtag tggaggatgc aatcagttgc gcctactgcg gtggcctgat 3000 
tcctccccgg cctgacccgc gaggacggcg cgcaaaatat tgctcagatg cgtgtcgtgc 3060 
cgcagccagc cgcgagcgcg ccaacaaacg ccacgccgag gagctggagg cggctaggtc 3120 
gcaaatggcg ctggaagtgc gtcccccgag cgaaattttg gccatggtcg tcacagagct 3180 
ggaagcggca gcgagaatta tcgcgatcgt ggcggtgccc gcaggcatga caaacatcgt 3240 
aaatgccgcg tttcgtgtgc cgtggccgcc caggacgtgt cagcgccgcc accacctgca 3300 
ccgaatcggc agcagcgtcg cgcgtcgaaa aagcgcacag gcggcaagaa gcgataagct 3360 
gcacgaatac ctgaaaaatg ttgaacgccc cgtgagcggt aactcacagg gcgtcggcta 3420 
acccccagtc caaacctggg agaaagcgct caaaaatgac tctagcggat tcacgagaca 3480 
ttgacacacc ggcctggaaa ttttccgctg atctgttcga cacccatccc gagctcgcgc 3540 
tgcgatcacg tggctggacg agcgaagacc gccgcgaatt cctcgctcac ctgggcagag 3600 
aaaatttcca gggcagcaag acccgcgact tcgccagcgc ttggatcaaa gacccggaca 3660 
cggagaaaca cagccgaagt tataccgagt tggttcaaaa tcgcttgccc ggtgccagta 3720 
tgttgctctg acgcacgcgc agcacgcagc cgtgcttgtc ctggacattg atgtgccgag 37B0 
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ccaccaggcc ggcgggaaaa tcgagcacgt aaaccccgag gtctacgcga ttttggagcg 3840 
ctgggcacgc ctggaaaaag cgccagcttg gatcggcgtg aatccactga gcgggaaatg 3900 
ccagctcatc tggctcattg atccggtgta tgccgcagca ggcatgagca gcccgaatat 3960 
gcgcctgctg gctgcaacga ccgaggaaat gacccgcgtt ttcggcgctg accaggcttt 4020 
ttcacatagg ctgagccgtg gccactgcac tctccgacga tcccagccgt accgctggca 4080 
tgcccagcac aatcgcgtgg atcgcctagc tgatcttatg gaggttgctc gcatgatctc 4140 
aggcacagaa aaacctaaaa aacgctatga gcaggagttt tctagcggac gggcacgtat 4200 
cgaagcggca agaaaagcca ctgcggaagc aaaagcactt gccacgcttg aagcaagcct 4260 
gccgagcgcc gctgaagcgt ctggagagct gatcgacggc gtccgtgtcc tctggactgc 4320 
tccagggcgt gccgcccgtg atgagacggc ttttcgccac gctttgactg tgggatacca 4380 
gttaaaagcg gctggtgagc gcctaaaaga caccaagggt catcgagcct acgagcgtgc 4440 
ctacaccgtc gctcaggcgg tcggaggagg ccgtgagcct gatctgccgc cggactgtga 4500 
ccgccagacg gattggccgc gacgtgtgcg cggctacgtc gctaaaggcc agccagtcgt 4560 
ccctgctcgt cagacagaga cgcagagcca gccgaggcga aaagctctgg ccactatggg 4620 
aagacgtggc ggtaaaaagg ccgcagaacg ctggaaagac ccaaacagtg agtacgcccg 4680 
agcacagcga gaaaaactag ctaagtccag tcaacgacaa gctaggaaag ctaaaggaaa 4740 
tcgcttgacc attgcaggtt ggtttatgac tgttgaggga gagactggct cgtggccgac 4800 
aatcaatgaa gctatgtctg aatttagcgt gtcacgtcag accgtgaata gagcacttaa 4860 
ggtctgcggg cattgaactt ccacgaggac gccgaaagct tcccagtaaa tgtgccatct 4920 
cgtaggcaga aaacggttcc cccgtagggt ctctctcttg gcctcctttc taggtcgggc 4980 
tgattgctct tgaagctctc taggggggct cacaccatag gcagataacg ttccccaccg 5040 
gctcgcctcg taagcgcaca aggactgctc ccaaagatct tcaaagccac t 5091 



<210> 64 
<211> 4323 
<212> DNA 

<213> K\bi8tliche Sequenz 
<220> 

<223> Beechreibung der kunstlichen Sequenz : Plasmid 
<400> 64 

tctctcagcg tatggttgtc gcctgagctg tagttgcctt catcgatgaa ctgctgtaca 60 
ttttgatacg tttttccgtc accgtcaaag attgatttat aatcctctac accgttgatg 120 
ttcaaagagc tgtctgatgc tgatacgtta acttgtgcag ttgtcagtgt ttgtttgccg 180 
taatgtttac cggagaaatc agtgtagaat aaacggattt ttccgtcaga tgtaaatgtg 240 
gctgaacctg accattcttg tgtttggtct tttaggatag aatcatttgc atcgaatttg 300 
tcgctgtctt taaagacgcg gccagcgttt ttccagctgt caatagaagt ttcgccgact 360 
ttttgataga acatgtaaat cgatgtgtca tccgcatttt taggatctcc ggctaatgca 420 
aagacgatgt ggtagccgtg atagtttgcg acagtgccgt cagcgttttg taatggccag 480 
ctgtcccaaa cgtccaggcc ttttgcagaa gagatatttt taattgtgga cgaatcaaat 540 
tcagaaactt gatatttttc atttttttgc tgttcaggga tttgcagcat atcatggcgt 600 
gtaatatggg aaatgccgta tgtttcctta tatggctttt ggttcgtttc tttcgcaaac 660 
gcttgagttg cgcctcctgc cagcagtgcg gtagtaaagg ttaatactgt tgcttgtttt 720 
gcaaactttt tgatgttcat cgttcatgtc tcctttttta tgtactgtgt tagcggtctg 780 
cttcttccag ccctcctgtt tgaagatggc aagttagtta cgcacaataa aaaaagacct 840 
aaaatatgta aggggtgacg ccaaagtata cactttgccc tttacacatt ttaggtcttg 900 
cctgctttat cagtaacaaa cccgcgcgat ttacttttcg acctcattct attagactct 960 
cgtttggatt gcaactggtc tattttcctc ttttgtttga tagaaaatca taaaaggatt 1020 
tgcagactac gggcctaaag aactaaaaaa tctatctgtt tcttttcatt ctctgtattt 1080 
tttatagttt ctgttgcatg ggcataaagt tgccttttta atcacaattc agaaaatatc 1140 
ataatatctc atttcactaa ataatagtga acggcaggta tatgtgatgg gttaaaaagg 1200 
atcggcggcc gctcgattta aatctcgaga ggcctgacgt cgggcccggt accacgcgtc 1260 
atatgactag ttcggaccta gggatatcgt cgacatcgat gctcttctgc gttaattaac 1320 
aattgggatc ctctagaccc gggatttaaa tcgctagcgg gctgctaaag gaagcggaac 1380 
acgtagaaag ccagtccgca gaaacggtgc tgaccccgga tgaatgtcag ctactgggct 1440 
atctggacaa gggaaaacgc aagcgcaaag agaaagcagg tagcttgcag tgggcttaca 1500 
tggcgatagc tagactgggc ggttttatgg acagcaagcg aaccggaatt gccagctggg 1560 
gcgccctctg gtaaggttgg gaagccctgc aaagtaaact ggatggcttt cttgccgcca 1620 
aggatctgat ggcgcagggg atcaagatct gatcaagaga caggatgagg atcgtttcgc 1680 
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atgattgaac aagatggatt gcacgcaggt 
ggctatgact gggcacaaca gacaatcggc 
gcgcaggggc gcccggttct ttttgtcaag 
caggacgagg cagcgcggct atcgtggctg 
ctcgacgttg tcactgaagc gggaagggac 
gatctcctgt catctcacct tgctcctgcc 
cggcggctgc atacgcttga tccggctacc 
atcgagcgag cacgtactcg gatggaagcc 
gagcatcagg ggctcgcgcc agccgaactg 
ggcgaggatc tcgtcgtgac ccatggcgat 
ggccgctttt ctggattcat cgactgtggc 
atagcgttgg ctacccgtga tattgctgaa 
ctcgtgcttt acggtatcgc cgctcccgat 
gacgagttct tctgagcggg actctggggt 
tgccatcacg agatttcgat tccaccgccg 
ttttccggga cgccggctgg atgatcctcc 
cccacgctag cggcgcgccg gccggcccgg 
aaataccgca tcaggcgctc ttccgcttcc 
cggctgcggc gagcggtatc agctcactca 
ggggataacg caggaaagaa catgtgagca 
aaggccgcgt tgctggcgtt tttccatagg 
cgacgctcaa gtcagaggtg gcgaaacccg 
cctggaagct ccctcgtgcg ctctcctgtt 
gcctttctcc cttcgggaag cgtggcgctt 
tcggtgtagg tcgttcgctc caagctgggc 
cgctgcgcct tatccggtaa ctatcgtctt 
ccactggcag cagccactgg taacaggatt 
gagttcttga agtggtggcc taactacggc 
gctctgctga agccagttac cttcggaaaa 
accaccgctg gtagcggtgg tttttttgtt 
ggatctcaag aagatccttt gatcttttct 
tcacgttaag ggattttggt catgagatta 
aaggccggcc gcggccgcca tcggcatttt 
tgtccttgtt caaggatgct gtctttgaca 
aggaagctcg gcgcaaacgt tgattgtttg 
cttgtaatca cgacattgtt tcctttcgct 
gttacatcgt taggatcaag atccattttt 
tatgggccag ttaaagaatt agaaacataa 
ccgtcaatcg tcatttttga tccgcgggag 
ttaaagacgt tcgcgcgttc aatttcatct 
atcacttttt tcagtgtgta atcatcgttt 
aactcagccg tgcgtttttt atcgctttgc 
gatgtgcttt tgccatagta tgctttgtta 
tcagttccag tgtttgcttc aaatactaag 
gga 
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tctccggccg cttgggtgga gaggctattc 1740 
tgctctgatg ccgccgtgtt ccggctgtca 1800 
accgacctgt ccggtgccct gaatgaactg 1860 
gccacgacgg gcgttccttg cgcagctgtg 1920 
tggctgctat tgggcgaagt gccggggcag 1980 
gagaaagtat ccatcatggc tgatgcaatg 2040 
tgcccattcg accaccaagc gaaacatcgc 2100 
ggtcttgtcg atcaggatga tctggacgaa 2160 
ttcgccaggc tcaaggcgcg catgcccgac 2220 
gcctgcttgc cgaatatcat. ggtggaaaat 2280 
cggctgggtg tggcggaccg ctatcaggac 2340 
gagcttggcg gcgaatgggc tgaccgcttc 2400 
tcgcagcgca tcgccttcta tcgccttctt 2460 
tcgaaatgac cgaccaagcg acgcccaacc 2520 
ccttctatga aaggttgggc ttcggaatcg 2580 
agcgcgggga tctcatgctg gagttcttcg 2640 
tgtgaaatac cgcacagatg cgtaaggaga 2700 
tcgctcactg actcgctgcg ctcggtcgtt 2760 
aaggcggtaa tacggttatc cacagaatca 2620 
aaaggccagc aaaaggccag gaaccgtaaa 2880 
ctccgccccc ctgacgagca tcacaaaaat 2940 
acaggactat aaagatacca ggcgtttccc 3000 
ccgaccctgc cgcttaccgg atacctgtcc 3060 
tctcatagct cacgctgtag gtatctcagt 3120 
tgtgtgcacg aaccccccgt tcagcccgac 3180 
gagtccaacc cggtaagaca cgacttatcg 3240 
agcagagcga ggtatgtagg cggtgctaca 3300 
tacactagaa ggacagtatt tggtatctgc 3360 
agagttggta gctcttgatc cggcaaacaa 3420 
tgcaagcagc agattacgcg cagaaaaaaa 3480 
acggggtctg acgctcagtg gaacgaaaac 3540 
tcaaaaagga tcttcaccta gatcctttta 3600 
cttttgcgtt tttatttgtt aactgttaat 3660 
acagatgttt tcttgccttt gatgttcagc 3720 
tctgcgtaga atcctctgtt tgtcatatag 3780 
tgaggtacag cgaagtgtga gtaagtaaag 3840 
aacacaaggc cagttttgtt cagcggcttg 3900 
ccaagcatgt aaatatcgtt agacgtaatg 3960 
tcagtgaaca ggtaccattt gccgttcatt 4020 
gttactgtgt tagatgcaat cagcggtttc 4080 
agctcaatca taccgagagc gccgtttgct 414 0 
agaagttttt gactttcttg acggaagaat 4200 
aataaagatt cttcgccttg gtagccatct 4260 
tatttgtggc ctttatcttc tacgtagtga 4320 

4323 



<210> 65 

<211> 35 

<212> DNA 

<213> PCR Primer 

<400> 65 

gagagagaga cgcgtcccag tggctgagac gcatc 35 



<210> 66 

<2U> 34 

<212> DNA 

<213> PCR Primer 

<400> 66 

ctctctctgt cgacgaattc aatcttacgg cctg 34 
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<210> 67 

<211> 38 

<212> DNA 

<213> PCR Primer 

<400> 67 

cggcaccacc gacatcatct tcacctgccc tcgttccg 

<210> 68 

<2U> 38 

<212> DNA 

<213> PCR Primer 

<400> 68 

cggaacgagg gcaggtgaag atgatgtcgg tggtgccg 



<210> 69 

<211> 31 

<212> DNA 

<213> PCR Primer 

<400> 69 

gagactcgag ggaaggtgaa tcgaatttcg g 

<210> 70 

<211> 38 

<212> DNA 

<213> PCR Primer 

<400> 70 

gtcccgggga gaacgcacga ttctccaaaa ataatcgc 



<210> 71 

<211> 23 

<212> DNA 

<213> PCR Primer 

<400> 71 

gaatcgtgcg ttctccccgg gac 



<210> 72 

<211> 22 

<212> DNA 

<213> PCR Primer 

<400> 72 

gtagttgacc gagttgatca cc 



<210> 73 

<211> 18 

<212> DNA 

<213> PCR Primer 

<400> 73 

ccggcctgga gaagctcg 



<210> 74 

<211> 28 

<212> DNA 

<213> PCR Primer 

<400> 74 
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gagagatatc cctcagcggg cgttgaag 28 



<210> 


75 


<211> 


1266 


<212> 


DNA 


<213> 


Lysc Mutante 


<220> 




<221> 


CDS 


<222> 


(1) (1266) 


<223> 




<400> 


75 



gtg gcc ctg gtc gta cag aaa tat ggc ggt tec teg ctt gag agt gcg 48 
Val Ala Leu Val Val Gin Lys Tyr Gly Gly Ser Ser Leu Glu Ser Ala 
1 5 10 15 

gaa cgc att aga aac gtc get gaa egg ate gtt gcc acc aag aag get 96 
Glu Arg lie Arg Asn Val Ala Glu Arg lie Val Ala Thr Lys Lys Ala 
20 25 30 

gga aat gat gtc gtg gtt gtc tgc tec gca atg gga gac acc acg gat 144 
Gly Asn Asp Val Val Val Val Cys Ser Ala Met Gly Asp Thr Thr Asp 
35 40 45 

gaa ctt eta gaa ctt gca gcg gca gtg aat ccc gtt ccg cca get cgt 192 
Glu Leu Leu Glu Leu Ala Ala Ala Val Asn Pro Val Pro Pro Ala Arg 
50 55 60 

gaa atg gat atg etc ctg act get ggt gag cgt att tct aac get etc 240 
Glu Met Asp Met Leu Leu Thr Ala Gly Glu Arg lie Ser Asn Ala Leu 
65 70 75 80 

gtc gcc atg get att gag tec ctt ggc gca gaa gcc caa tct ttc acg 288 
Val Ala Met Ala He Glu Ser Leu Gly Ala Glu Ala Gin Ser Phe Thr 
85 so 95 

ggc tct cag get ggt gtg etc acc acc gag cgc cac gga aac gca cgc 336 
Gly Ser Gin Ala Gly Val Leu Thr Thr Glu Arg Hia Gly Asn Ala Arg 
100 105 no 

att gtt gat gtc act cca ggt cgt gtg cgt gaa gca etc gat gag ggc 384 
He val Asp Val Thr Pro Gly Arg Val Arg Glu Ala Leu Asp Glu Gly 
H5 120 125 

aag ate tgc att gtt get ggt ttc cag ggt gtt aat aaa gaa acc cgc 432 
Lys He Cys He Val Ala Gly Phe Gin Gly val Asn Lys Glu Thr Arg 
130 135 140 

gat gtc acc acg ttg ggt cgt ggt ggt tct gac acc act gca gtt gcg 480 
Asp Val Thr Thr Leu Gly Arg Gly Gly Ser Asp Thr Thr Ala Val Ala 
145 150 155 160 

ttg gca get get ttg aac get gat gtg tgt gag att tac teg gac gtt 528 
Leu Ala Ala Ala Leu Asn Ala Asp Val Cys Glu He Tyr Ser Asp Val 
165 170 * 175 

gac ggt gtg tat acc get gac ccg cgc ate gtt cct aat gca cag aag 576 
Asp Gly Val Tyr Thr Ala Asp Pro Arg He Val Pro Asn Ala Gin Lys 
1B0 185 190 

ctg gaa aag etc age ttc gaa gaa atg ctg gaa ctt get get gtt ggc 624 
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Leu Glu Lys Leu Ser Phe Glu Glu Met Leu Glu Leu Ala Ala Val Gly 
195 200 205 



tec aag att ttg gtg ctg cgc agt gtt gaa tac get cgt gca ttc aat 
Ser Lys He Leu Val Leu Arg Ser Val Glu Tyr Ala Arg Ala Phe Asn 
210 215 220 



672 



gtg cca ctt cgc gta cgc teg tct tat agt aat gat ccc ggc act ttg 720 
Val Pro Leu Arg Val Arg Ser Ser Tyr Ser Asn Asp Pro Gly Thr Leu 
225 230 235 240 

att gec ggc tct atg gag gat att cct gtg gaa gaa gca gtc ctt acc 768 
He Ala Gly Ser Met Glu Asp He Pro Val Glu Glu Ala Val Leu Thr 
245 250 255 

ggt gtc gca acc gac aag tec gaa gee aaa gta acc gtt ctg ggt att 816 
Gly Val Ala Thr Asp Lys Ser Glu Ala Lys Val Thr Val Leu Gly He 
260 265 270 

tec gat aag cca ggc gag get gcg aag gtt ttc cgt gcg ttg get gat 864 
Ser Asp Lys Pro Gly Glu Ala Ala Lys Val Phe Arg Ala Leu Ala Asp 
275 280 285 

gca gaa ate aac att gac atg gtt ctg cag aac gtc tct tct gta gaa 912 
Ala Glu He Asn He Asp Met Val Leu Gin Asn Val Ser Ser Val Glu 
290 295 300 

gac ggc acc acc gac ate ate ttc acc tgc cct cgt tec gac ggc cgc 960 
Asp Gly Thr Thr Asp He He Phe Thr Cys Pro Arg Ser Asp Gly Arg 
305 310 315 320 

cgc gcg atg gag ate ttg aag aag ctt cag gtt cag ggc aac tgg acc 1008 
Arg Ala Met Glu He Leu Lys Lys Leu Gin Val Gin Gly Asn Trp Thr 
325 330 335 

aat gtg ctt tac gac gac cag gtc ggc aaa gtc tec etc gtg ggt get 1056 
Asn Val Leu Tyr Asp Asp Gin Val Gly Lye Val Ser Leu Val Gly Ala 
340 345 350 

ggc atg aag tct cac cca ggt gtt acc gca gag ttc atg gaa get ctg 1104 
Gly Met Lys Ser His Pro Gly Val Thr Ala Glu Phe Met Glu Ala Leu 
355 360 365 

cgc gat gtc aac gtg aac ate gaa ttg att tec acc tct gag att cgt 1152 
Arg Asp Val Asn Val Asn He Glu Leu He Ser Thr Ser Glu lie Arg 
370 375 380 

att tec gtg ctg ate cgt gaa gat gat ctg gat get get gca cgt gca 1200 
He Ser Val Leu He Arg Glu Asp Asp Leu Asp Ala Ala Ala Arg Ala 
3 85 390 395 400 

ttg cat gag cag ttc cag ctg ggc ggc gaa gac gaa gee gtc gtt tat 1248 
Leu His Glu Gin Phe Gin Leu Gly Gly Glu Asp Glu Ala Val Val Tyr 
4 05 410 415 

gca ggc acc gga cgc taa 12 6g 
Ala Gly Thr Gly Arg 
420 



<210> 76 
<211> 421 



WO 03/087386 PCT/EP03/04010 

241 

<212> PRT 

<213> LysC Mutante 

<400> 76 

Val Ala Leu Val Val Gin Lys Tyr Gly Gly Ser Ser Leu Glu Ser Ala 
1 5 10 15 

Glu Arg He Arg Asn Val Ala Glu Arg He Val Ala Thr Lye Lys Ala 
20 25 30, 

Gly Asn Asp Val Val Val Val Cys Ser Ala Met Gly Asp Thr Thr Asp 
35 40 45 

Glu Leu Leu Glu Leu Ala Ala Ala Val Asn Pro Val Pro Pro Ala Arg 
50 55 60 

Glu Met Asp Met Leu Leu Thr Ala Gly Glu Arg He Ser Asn Ala Leu 
65 70 75 80 

Val Ala Met Ala He Glu Ser Leu Gly Ala Glu Ala Gin Ser Phe Thr 
85 90 95 

Gly Ser Gin Ala Gly Val Leu Thr Thr Glu Arg His Gly Asn Ala Arg 
100 105 no 

He Val Asp Val Thr Pro Gly Arg Val Arg Glu Ala Leu Asp Glu Gly 
115 120 125 

Lys He Cys He Val Ala Gly Phe Gin Gly Val Asn Lya Glu Thr Arg 
130 135 140 

Asp Val Thr Thr Leu Gly Arg Gly Gly Ser Asp Thr Thr Ala Val Ala 
145 150 155 160 

Leu Ala Ala Ala Leu Asn Ala Asp Val Cys Glu lie Tyr Ser Asp Val 
165 170 175 

Asp Gly Val Tyr Thr Ala Asp Pro Arg He Val Pro Asn Ala Gin Lys 
180 185 190 

Leu Glu Lys Leu Ser Phe Glu Glu Met Leu Glu Leu Ala Ala Val Gly 
195 200 205 

Ser Lys He Leu Val Leu Arg Ser Val Glu Tyr Ala Arg Ala Phe Asn 
210 215 220 



Val Pro Leu Arg Val Arg Ser Ser Tyr Ser Asn Asp Pro Gly Thr Leu 
225 230 235 240 
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He Ala Gly Ser Met Glu Asp He Pro Val Glu Glu Ala Val Leu Thr 
245 250 255 



Gly Val Ala Thr Asp LyB Ser Glu Ala Lys Val Thr Val Leu Gly He 
260 265 270 



Ser Asp Lys Pro Gly Glu Ala Ala Lys Val Phe Arg Ala Leu Ala Asp 
275 280 285 



Ala Glu He Asn He Asp Met Val Leu Gin. Asn Val Ser Ser Val Glu 
290 295 300 



Asp Gly Thr Thr Asp He He Phe Thr Cys Pro Arg Ser Asp Gly Arg 
305 310 315 320 



Arg Ala Met Glu He Leu Lys Lys Leu Gin Val Gin Gly Asn Trp Thr 
325 330 335 



Asn Val Leu Tyr Asp Asp Gin Val Gly Lys Val Ser Leu Val Gly Ala 
340 345 350 



Gly Met Lys Ser His Pro Gly Val Thr Ala Glu Phe Met Glu Ala Leu 
355 360 365 



Arg Asp Val Asn Val Asn He Glu Leu He Ser Thr Ser Glu He Arg 
370 375 380 



He Ser Val Leu He Arg Glu Asp Asp Leu Asp Ala Ala Ala Arg Ala 
385 390 395 400 



Leu His Glu Gin Phe Gin Leu Gly Gly Glu Asp Glu Ala Val Val Tyr 
405 410 415 



Ala Gly Thr Gly Arg 
420 



<210> 77 
<211> 5860 
<212> DNA 
<213> Plasmid 
<400> 77 

cccggtacca cgcgtcccag tggctgagac gcatccgcta aagccccagg aaccctgtgc 60 
agaaagaaaa cactcctctg gctaggtaga cacagtttat aaaggtagag ttgagcgggt 120 
aactgtcagc acgtagatcg aaaggtgcac aaaggtggcc ctggtcgtac agaaatatgg 180 
cggttcctcg cttgagagtg cggaacgcat tagaaacgtc gctgaacgga tcgttgccac 240 
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caagaaggct ggaaatgatg tcgtggttgt 
act^ctagaa cttgcagcgg cagtgaatcc 
cctgactgct ggtgagcgta tttctaacgc 
cgcagaagcc caatctttca cgggctctca 
aaacgcacgc attgttgatg tcactccagg 
gatctgcatt gttgctggtt tccagggtgt 
gggtcgtggt ggttctgaca ccactgcagt 
gtgtgagatt tactcggacg ttgacggtgt 
tgcacagaag ctggaaaagc tcagcttcga 
caagattttg gtgctgcgca gtgttgaata 
acgctcgtct tatagtaatg atcccggcac 
tgtggaagaa gcagtcctta ccggtgtcgc 
tctgggtatt tccgataagc caggcgaggc 
agaaatcaac attgacatgg ttctgcagaa 
catcaccttc acctgccctc gttccgacgg 
tcaggttcag ggcaactgga ccaatgtgct 
cgtgggtgct ggcatgaagt ctcacccagg 
cgatgtcaac gtgaacatcg aattgatttc 
ccgtgaagat gatctggatg ctgctgcacg 
cgaagacgaa gccgtcgttt atgcaggcac 
acaatgacca ccatcgcagt tgttggtgca 
cttttggaag agcgcaattt cccagctgac 
gcaggccgta agattgaatt cgtcgacatc 
atcctctaga cccgggattt aaatcgctag 
aagccagtcc gcagaaacgg tgctgacccc 
caagggaaaa cgcaagcgca aagagaaagc 
agctagactg ggcggtttta tggacagcaa 
ctggtaaggt tgggaagccc tgcaaagtaa 
gatggcgcag gggatcaaga tctgatcaag 
aacaagatgg attgcacgca ggttctccgg 
actgggcaca acagacaatc ggctgctctg 



243 

ctgctccgca 
cgttccgcca 
tctcgtcgcc 
ggctggtgtg 
tcgtgtgcgt 
taataaagaa 
tgcgttggca 
gtataccgct 
agaaatgctg 
cgctcgtgca 
tttgattgcc 
aaccgacaag 
tgcgaaggtt 
cgtctcttct 
ccgccgcgcg 
ttacgacgac 
tgttaccgca 
cacctctgag 
tgcattgcat 
cggacgctaa 
accggccagg 
actgttcgtt 
gatgctcttc 
cgggctgcta 
ggatgaatgt 
a 99tagcttg 
gcgaaccgga 
actggatggc 
agacaggatg 
ccgcttgggt 
atgccgccgt 



atgggagaca 
gctcgtgaaa 
atggctattg 
ctcaccaccg 
gaagcactcg 
acccgcgatg 
gctgctttga 
gacccgcgca 
gaacttgctg 
ttcaatgtgc 
ggctctatgg 
tccgaagcca 
ttccgtgcgt 
gtagaagacg 
atggagatct 
caggtcggca 
gagttcatgg 
attcgtattt 
gagcagttcc 
agttttaaag 
tcggccaggt 
tctttgcttc 
tgcgttaatt 
aaggaagcgg 
cagctactgg 
cagtgggctt 
attgccagct 
tttcttgccg 
aggatcgttt 
ggagaggcta 
gttccggctg 



ccacggatga 
tggatatgct 
agtcccttgg 
agcgccacgg 
atgagggcaa 
tcaccacgtt 
acgctgatgt 
tcgttcctaa 
ctgttggctc 
cacttcgcgt 
aggatattcc 
aagtaaccgt 
tggctgatgc 
gcaccaccga 
tgaagaagct 
aagtctccct 
aagctctgcg 
ccgtgctgat 
agctgggcgg 
gagtagtttt 
tatgcgcacc 
cccacgttcc 
aacaattggg 
aacacgtaga 
gctatctgga 
acatggcgat 
ggggcgccct 
ccaaggatct 
cgcatgattg 
ttcggctatg 
tcagcgcagg 



300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
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ggcgcccggt tctttttgtc aagaccgacc tgtccggtgc cctgaatgaa ctgcaggacg 2160 

aggcagcgcg gctatcgtgg ctggccacga cgggcgttcc ttgcgcagct gtgctcgacg 2220 

ttgtcactga agcgggaagg gactggctgc tattgggcga agtgccgggg caggatctcc 2280 

tgtcatctca ccttgctcct gccgagaaag tatccatcat ggctgatgca atgcggcggc 2340 

tgcatacgct tgatccggct acctgcccat tcgaccacca agcgaaacat cgcatcgagc 2400 

gagcacgtac tcggatggaa gccggtcttg tcgatcagga tgatctggac gaagagcatc 2460 

aggggctcgc gccagccgaa ctgttcgcca ggctcaaggc gcgcatgccc gacggcgagg 2S20 

atctcgtcgt gacccatggc gatgcctgct tgccgaatat catggtggaa aatggccgct 2580 

tttctggatt catcgactgt ggccggctgg gtgtggcgga ccgctatcag gacatagcgt 2640 

tggctacccg tgatattgct gaagagcttg gcggcgaatg ggctgaccgc ttcctcgtgc 2700 

tttacggtat cgccgctccc gattcgcagc gcatcgcctt ctatcgcctt cttgacgagt 2760 

tcttctgagc gggactctgg ggttcgaaat gaccgaccaa gcgacgccca acctgccatc 2820 

acgagatttc gattccaccg ccgccttcta tgaaaggttg ggcttcggaa tcgttttccg 2880 

ggacgccggc tggatgatcc tccagcgcgg ggatctcatg ctggagttct tcgcccacgc 2940 

tagcggcgcg ccggccggcc cggtgtgaaa taccgcacag atgcgtaagg agaaaatacc 3000 

gcatcaggcg ctcttccgct tcctcgctca ctgactcgct gcgctcggtc gttcggctgc 3060 

ggcgagcggt atcagctcac tcaaaggcgg taatacggtt atccacagaa tcaggggata 3120 

acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg 3180 

cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa aatcgacgct 3240 

caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt ccccctggaa 3300 

gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg tccgcctttc 3360 

tcccttcggg aagcgtggcg ctttctcata gctcacgctg taggtatctc agttcggtgt 3420 

aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg 3480 

ccttatccgg taactatcgt cttgagtcca acccggtaag acacgactta tcgccactgg 3540 

cagcagccac tggtaacagg attagcagag cgaggtatgt aggcggtgct acagagttct 3600 

tgaagtggtg gcctaactac ggctacacta gaaggacagt atttggtatc tgcgctctgc 3660 

tgaagccagt taccttcgga aaaagagttg gtagctcttg atccggcaaa caaaccaccg 3720 

ctggtagcgg tggttttttt gtttgcaagc agcagattac gcgcagaaaa aaaggatctc 3780 

aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa aactcacgtt 3840 

aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt ttaaaggccg 3900 

gccgcggccg ccatcggcat tttcttttgc gtttttattt gttaactgtt aattgtcctt 3960 
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gttcaaggat gctgtctttg acaacagatg ttttcttgcc tttgatgttc agcaggaagc 4020 

tcggcgcaaa cgttgattgt ttgtctgcgt agaatcctct gtttgtcata £agcttgtaa 4080 

tcacgacatt gtttcctttc gcttgaggta cagcgaagtg tgagtaagta aaggttacat 4140 

cgttaggatc aagatccatt tttaacacaa ggccagtttt gttcagcggc ttgtatgggc 4200 

cagttaaaga attagaaaca taaccaagca tgtaaatatc gttagacgta, atgccgtcaa 4260 

tcgtcatttt tgatccgcgg gagtcagtga acaggtacca tttgccgttc attttaaaga 4320 

cgttcgcgcg ttcaatttca tctgttactg tgttagatgc aatcagcggt ttcatcactt 4380 

ttttcagtgt gtaatcatcg tttagctcaa tcataccgag agcgccgttt gctaactcag 4440 

ccgtgcgttt tttatcgctt tgcagaagtt tttgactttc ttgacggaag aatgatgtgc 4500 

ttttgccata gtatgctttg ttaaataaag attcttcgcc ttggtagcca tcttcagttc 4560 

cagtgtttgc ttcaaatact aagtatttgt ggcctttatc ttctacgtag tgaggatctc 4620 

tcagcgtatg gttgtcgcct gagctgtagt tgccttcatc gatgaactgc tgtacatttt 4680 

gatacgtttt tccgtcaccg tcaaagattg atttataatc ctctacaccg ttgatgttca 4740 

aagagctgtc tgatgctgat acgttaactt gtgcagttgt cagtgtttgt ttgccgtaat 4800 

gtttaccgga gaaatcagtg tagaataaac ggatttttcc gtcagatgta aatgtggctg 4860 

aacctgacca ttcttgtgtt tggtctttta ggatagaatc atttgcatcg aatttgtcgc 4920 

tgtctttaaa gacgcggcca gcgtttttcc agctgtcaat agaagtttcg ccgacttttt 4980 

gatagaacat gtaaatcgat gtgtcatccg catttttagg atctccggct aatgcaaaga 5040 

cgatgtggta gccgtgatag tttgcgacag tgccgtcagc gttttgtaat ggccagctgt 5100 

cccaaacgtc caggcctttt gcagaagaga tatttttaat tgtggacgaa tcaaattcag 5160 

aaacttgata tttttcattt ttttgctgtt cagggatttg cagcatatca tggcgtgtaa 5220 

tatgggaaat gccgtatgtt tccttatatg gcttttggtt cgtttctttc gcaaacgctt 5280 

gagttgcgcc tcctgccagc agtgcggtag taaaggttaa tactgttgct tgttttgcaa 5340 

actttttgat gttcatcgtt catgtctcct tttttatgta ctgtgttagc ggtctgcttc 5400 

ttccagccct cctgtttgaa gatggcaagt tagttacgca caataaaaaa agacctaaaa 5460 

tatgtaaggg gtgacgccaa agtatacact ttgcccttta cacattttag gtcttgcctg 5520 

ctttatcagt aacaaacccg cgcgatttac ttttcgacct cattctatta gactctcgtt 5580 

tggattgcaa ctggtctatt ttcctctttt gtttgataga aaatcataaa aggatttgca 5640 

gactacgggc ctaaagaact aaaaaatcta tctgtttctt ttcattctct gtatttttta 5700 

tagtttctgt tgcatgggca taaagttgcc tttttaatca caattcagaa aatatcataa 5760 

tatctcattt cactaaataa tagtgaacgg caggtatatg tgatgggtta aaaaggatcg 5820 
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gcggccgctc gatttaaatc tcgagaggcc tgacgtcggg 5860 

<210> 78 
<211> 5860 
<212> DNA 
<213> Plasmid 
<40O> 78 

cccggtacca cgcgtcccag tggctgagac gcatccgcta aagccccagg aaccctgtgc 60 

agaaagaaaa cactcctctg gctaggtaga cacagtttat aaaggtagag ttgagcgggt 120 

aactgtcagc acgtagatcg aaaggtgcac aaaggtggcc ctggtcgtac agaaatatgg 180 

cggttcctcg cttgagagtg cggaacgcat tagaaacgtc gctgaacgga tcgttgccac 240 

caagaaggct ggaaatgatg tcgtggttgt ctgctccgca atgggagaca ccacggatga 300 

acttctagaa cttgcagcgg cagtgaatcc cgttccgcca gctcgtgaaa tggatatgct 360 

cctgactgct ggtgagcgta tttctaacgc tctcgtcgcc atggctattg agtcccttgg 420 

cgcagaagcc caatctttca cgggctctca ggctggtgtg ctcaccaccg agcgccacgg 480 

aaacgcacgc attgttgatg tcactccagg tcgtgtgcgt gaagcactcg atgagggcaa 540 

gatctgcatt gttgctggtt tccagggtgt taataaagaa acccgcgatg tcaccacgtt 600 

gggtcgtggt ggttctgaca ccactgcagt tgcgttggca gctgctttga acgctgatgt 660 

gtgtgagatt tactcggacg ttgacggtgt gtataccgct gacccgcgca tcgttcctaa 720 

tgcacagaag ctggaaaagc tcagcttcga agaaatgctg gaacttgctg ctgttggctc 780 

caagattttg gtgctgcgca gtgttgaata cgctcgtgca ttcaatgtgc cacttcgcgt 840 

acgctcgtct tatagtaatg atcccggcac tttgattgcc ggctctatgg aggatattcc 900 

tgtggaagaa gcagtcctta ccggtgtcgc aaccgacaag tccgaagcca aagtaaccgt 960 

tctgggtatt tccgataagc caggcgaggc tgcgaaggtt ttccgtgcgt tggctgatgc 1020 

agaaatcaac attgacatgg ttctgcagaa cgtctcttct gtagaagacg gcaccaccga 1080 

catcatcttc acctgccctc gttccgacgg ccgccgcgcg atggagatct tgaagaagct 1140 

tcaggttcag ggcaactgga ccaatgtgct ttacgacgac caggtcggca aagtctccct 1200 

cgtgggtgct ggcatgaagt ctcacccagg tgttaccgca gagttcatgg aagctctgcg 1260 

cgatgtcaac gtgaacatcg aattgatttc cacctctgag attcgtattt ccgtgctgat 1320 

ccgtgaagat gatctggatg ctgctgcacg tgcattgcat gagcagttcc agctgggcgg 1380 

cgaagacgaa gccgtcgttt atgcaggcac cggacgctaa agttttaaag gagtagtttt 1440 

acaatgacca ccatcgcagt tgttggtgca accggccagg tcggccaggt tatgcgcacc 1500 

cttttggaag agcgcaattt cccagctgac actgttcgtt tctttgcttc cccacgttcc 1560 

gcaggccgta agattgaatt cgtcgacatc gatgctcttc tgcgttaatt aacaattggg 1620 
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atcctctaga cccgggattt aaatcgctag cgggctgcta aaggaagcgg aacacgtaga 1680 

aagccagtcc gcagaaacgg tgctgacccc ggatgaatgt cagctactgg uctatctgga 1740 

caagggaaaa cgcaagcgca aagagaaagc aggtagcttg cagtgggctt acatggcgat 1800 

agctagactg ggcggtttta tggacagcaa gcgaaccgga attgccagct ggggcgccct 1860 

ctggtaaggt tgggaagccc tgcaaagtaa actggatggc tttcttgccg ccaaggatct 1920 

gatggcgcag gggatcaaga tctgatcaag agacaggatg aggatcgttt cgcatgattg 1980 

aacaagatgg attgcacgca ggttctccgg ccgcttgggt ggagaggcta ttcggctatg 2040 

actgggcaca acagacaatc ggctgctctg atgccgccgt gttccggctg tcagcgcagg 2100 

ggcgcccggt tctttttgtc aagaccgacc tgtccggtgc cctgaatgaa ctgcaggacg 2160 

aggcagcgcg gctatcgtgg ctggccacga cgggcgttcc ttgcgcagct gtgctcgacg 2220 

ttgtcactga agcgggaagg gactggctgc tattgggcga agtgccgggg caggatctcc 2280 

tgtcatctca ccttgctcct gccgagaaag tatccatcat ggctgatgca atgcggcggc 2340 

tgcatacgct tgatccggct acctgcccat tcgaccacca agcgaaacat cgcatcgagc 24 00 

gagcacgtac tcggatggaa gccggtcttg tcgatcagga tgatctggac gaagagcatc 2460 

aggggctcgc gccagccgaa ctgttcgcca ggctcaaggc gcgcatgccc gacggcgagg 2520 

atctcgtcgt gacccatggc gatgcctgct tgccgaatat catggtggaa aatggccgct 2580 

tttctggatt catcgactgt ggccggctgg gtgtggcgga ccgctatcag gacatagcgt 2640 

tggctacceg tgatattgct gaagagcttg gcggcgaatg ggctgaccgc ttcctcgtgc 2700 

tttacggtat cgccgctccc gattcgcagc gcatcgcctt ctatcgcctt cttgacgagt 2760 

tcttctgagc gggactctgg ggttcgaaat gaccgaccaa gcgacgccca acctgccatc 2820 

acgagatttc gattccaccg ccgccttcta tgaaaggttg ggcttcggaa tcgttttccg 2880 

ggacgccggc tggatgatcc tccagcgcgg ggatctcatg ctggagttct tcgcccacgc 2940 

tagcggcgcg ccggccggcc cggtgtgaaa taccgcacag atgcgtaagg agaaaatacc 3000 

gcatcaggcg ctcttcegct tcctcgctca ctgactcgct gcgctcggtc gttcggctgc 3060 

ggcgagcggt atcagctcac tcaaaggcgg taatacggtt atccacagaa tcaggggata 3120 

acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg 3180 

cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa aatcgacgct 3240 

caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt ccccctggaa 3300 

gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg tccgcctttc 3360 

tcccttcggg aagcgtggcg ctttctcata gctcacgctg taggtatctc agttcggtgt 3420 

aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg 3480 
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ccttatccgg taactatcgt cttgagtcca acccggtaag acacgactta tcgccactgg 3540 

cagcagccac tggtaacagg attagcagag cgaggtatgt aggcggtgct acagagttct 3600 

tgaagtggtg gcctaactac ggctacacta gaaggacagt atttggtatc tgcgctctgc 3660 

tgaagccagt taccttcgga aaaagagttg gtagctcttg atccggcaaa caaaccaccg 3720 

ctggtagcgg tggttttttt gtttgcaagc agcagattac gcgcagaaaa aaaggatctc 3780 

aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa aactcacgtt 3840 

aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt ttaaaggccg 3900 

gccgcggccg ccatcggcat tttcttttgc gtttttattt gttaactgtt aattgtcctt 3960 

gttcaaggat gctgtctttg acaacagatg ttttcttgcc tttgatgttc agcaggaagc 4020 

tcggcgcaaa cgttgattgt ttgtctgcgt agaatcctct gtttgtcata tagcttgtaa 4080 

tcacgacatt gtttcctttc gcttgaggta cagcgaagtg tgagtaagta aaggttacat 4140 

cgttaggatc aagatccatt tttaacacaa ggccagtttt gttcagcggc ttgtatgggc 4200 

cagttaaaga attagaaaca taaccaagca tgtaaatatc gttagacgta atgccgtcaa 4260 

tcgtcatttt tgatccgcgg gagtcagtga acaggtacca tttgccgttc attttaaaga 4320 

cgttcgcgcg ttcaatttca tctgttactg tgttagatgc aatcagcggt ttcatcactt 4380 

ttttcagtgt gtaatcatcg tttagctcaa tcataccgag agcgccgttt gctaactcag 4440 

ccgtgcgttt tttatcgctt tgcagaagtt tttgactttc ttgacggaag aatgatgtgc 4500 

ttttgccata gtatgctttg ttaaataaag attcttcgcc ttggtagcca tcttcagttc 4560 

cagtgtttgc ttcaaatact aagtatttgt ggcctttatc ttctacgtag tgaggatctc 4620 

tcagcgtatg gttgtcgcct gagctgtagt tgccttcatc gatgaactgc tgtacatttt 4680 

gatacgtttt tccgtcaccg tcaaagattg atttataatc ctctacaccg ttgatgttca 4740 

aagagctgtc tgatgctgat acgttaactt gtgcagttgt cagtgtttgt ttgccgtaat 4 800 

gtttaccgga gaaatcagtg tagaataaac ggatttttcc gtcagatgta aatgtggctg 4860 

aacctgacca ttcttgtgtt tggtctttta ggatagaatc atttgcatcg aatttgtcgc 4920 

tgtctttaaa gacgcggcca gcgtttttcc agctgtcaat agaagtttcg ccgacttttt 4980 

gatagaacat gtaaatcgat gtgtcatccg catttttagg atctccggct aatgcaaaga 5040 

cgatgtggta gccgtgatag tttgcgacag tgccgtcagc gttttgtaat ggccagctgt 5100 

cccaaacgtc caggcctttt gcagaagaga tatttttaat tgtggacgaa tcaaattcag 5160 

aaacttgata tttttcattt ttttgctgtt cagggatttg cagcatatca tggcgtgtaa 5220 

tatgggaaat gccgtatgtt tccttatatg gcttttggtt cgtttctttc gcaaacgctt 5280 

gagttgcgcc tcctgccagc agtgcggtag taaaggttaa tactgttgct tgttttgcaa 5340 
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actttttgat gttcatcgtt catgtctcct tttttatgta ctgtgttagc ggtctgcttc 5400 

ttccagccct cctgtttgaa gatggcaagt tagttacgca caataaaaaa agacctaaaa 5460 

tatgtaaggg gtgacgccaa agtatacact ttgcccttta cacattttag gtcttgcctg 5520 

ctttatcagt aacaaacccg cgcgatttac ttttcgacct cattctatta gactctcgtt 5580 

tggattgcaa ctggtctatt ttcctctttt gtttgataga aaatcataaa aggatttgca 5640 

gactacgggc ctaaagaact aaaaaatcta tctgtttctt ttcattctct gtatttttta 5700 

tagtttctgt tgcatgggca taaagttgcc tttttaatca caattcagaa aatatcataa 5760 

tatctcattt cactaaataa tagtgaacgg caggtatatg tgatgggtta aaaaggatcg 5820 

gcggccgctc gatttaaatc tcgagaggcc tgacgtcggg 5860 

<210> 79 
<211> 8787 
<212> DNA 
<213> Plasmid 
<400> 79 

tcgagggaag gtgaatcgaa tttcggggct ttaaagcaaa aatgaacagc ttggtctata 60 

gtggctaggt accctttttg ttttggacac atgtagggtg gccgaaacaa agtaatagga 120 

caacaacgct cgaccgcgat tatttttgga gaatcgtgcg ttctccccgg gacgtcccac 180 

gacgggcggc accgggcaga ggcaaagccg acagccgtcg catcctaggg agccctttca 240 

tggcctcgtc gccatccacc ccgcccgccg acacccgcac ccgcgtgtcc gccctccgag 300 

aggccctcgc cacccgcgtg gtggtcgccg acggcgccat gggcaccatg ctccaggccc 360 

agaaccccac gctggacgac ttccagcagc tcgaagggtg caacgaggtc ctgaacctca 420 

cccggcccga catcgtccgc tcggtgcacg aggagtactt cgcggccggc gtcgactgcg 480 

tcgagaccaa caccttcggc gccaaccact ccgccctggg cgagtacgac atccccgagc 540 

gcgtccacga actgtccgag gccggcgccc gcgtcgcccg cgaggtcgcc gacgagttcg 600 

gcgcccgcga cggccggcag cgctgggtgc tgggctccat gggccccggc accaagctcc 660 

ccaccctcgg ccacgccccg tacaccgtcc tgcgcgacgc ctaccagcgc aacgccgagg 720 

gactggtcgc gggcggcgcg gacgcactgc tggtggagac cacgcaggac ctgctccaga 780 

ccaaggcctc ggtgctcggc gcccggcgcg ccctggacgt cctcggcctc gacctgccgc 840 

tcatcgtgtc cgtcaccgtc gagaccaccg gcaccatgct gctcggctcg gagatcggcg 900 

ccgcgctcac cgcgctggaa ccgctcggca tcgacatgat cggcctgaac tgcgccaccg 960 

gccccgccga gatgagcgag cacctgcgct acctcgcccg gcactcccgc atcccgctga 1020 

cctgcatgcc caacgccggt ctgcccgtcc tcggcaagga cggcgcccac tacccgctga 1080 

ccgcgcccga gctggccgac gcacacgaga ccttcgtgcg cgagtacggc ctgtccctgg 1140 
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tcggcggctg ctgcggcacc acgcccgagc acctgcgcca ggtcgtcgag cgggtccggg 1200 

acaccgcccc caccgcacgc gacccgcgcc ccgagcccgg cgccgcctcg ctctaccaga 1260 

ccgtgccctt ccgccaggac acctcctacc tggccatcgg cgagcgcacc aacgccaacg 1320 

ggtccaagaa gttccgcgag gccatgctgg acggccgctg ggacgactgc gtcgagatgg 1380 

cccgcgacca gatccgcgaa ggcgcgcaca tgctcgacct ctgcgtcgac tacgtcggcc 1440 

gggacggcgt cgccgacatg gaggaactgg ccggccggtt cgccaccgcc tccacgctgc 1500 

cgatcgtcct cgactccacc gaggtcgacg tcatccgggc cggcctggag aagctcggcg 1560 

gccgcgcggt gatcaactcg gtcaactacg aggacggcgc cggccccgag tcccggttcg 1620 

cccgcgtcac gaagctcgcc cgggagcacg gcgccgcgct gatcgcgctg accatcgacg 1680 

aggtgggaca ggcccgcacc gccgagaaga aggtcgagat cgccgaacgg ctcatcgacg 1740 

acctcaccgg caactggggc atccacgagt ccgacatcct cgtcgactgc ctgaccttca 1800 

ccatctgcac cggccaggag gagtcccgca_aggacggcct ggccaccatc gagggcatcc 1860 

gggaactcaa gcggcgccac ccggacgtgc agaccacgct cggcctgtcg aacatctcct 1920 

tcggcctcaa cccggccgcc cgcatcctgc tcaactccgt cttcctcgac gaatgcgtca 1980 

aggccggcct ggactcggcc atcgtgcacg cgagcaagat cctgccgatc gcccgcttcg 2040 

acgaggagca ggtcaccacc gccctcgact tgatctacga ccgccgccgc gagggctacg 2100 

accccctgca aaagctcatg cagctcttcg agggcgccac cgccaagtcg ctgaaggcct 2160 

ccaaggccga ggaactggcc gccctcccgc tggaggagcg cctcaagcgc cgcatcatcg 2220 

acggcgagaa gaacggcctc gaacaggacc tcgacgaggc cctccgggag cgcccggccc 2280 

tcgagatcgt caacgacacc ctgctcgacg gtatgaaggt cgtcggcgag ctgttcggct 2340 

ccggccagat gcagctgccg ttcgtgctcc agtccgccga ggtcatgaag accgcggtgg 2400 

cccacctgga gccgcacatg gagaagaccg acgacgacgg caagggcacg atcgtgctgg 2460 

ccaccgtccg cggcgacgtc cacgacatcg gcaagaacct cgtcgacatc atcctgtcca 2520 

acaacggcta caacgtcgtc aacctcggca tcaagcagcc cgtctccgcg atcctggaag 2580 

cggccgacga gcaccgggcc gacgtcatcg gcatgtccgg cctcctcgtc aagtccacgg 2640 

tgatcatgaa ggagaacctg gaggagctga accagcgcaa gctggccgcc gactacccgg 2700 

tcatcctcgg cggcgccgcc ctcaccaggg cctacgtcga acaggacctg cacgagatct 2760 

acgacggcga ggtccgctac gcccgcgacg ccttcgaggg cctgcgcctc atggacgccc 2820 

tcatcggcat caagcgcggc gtgcccggcg ccaagctgcc ggagctgaag cagcgccggg 2880 

tgcgggccgc caccgtcgag atcgacgagc gccccgagga aggccacgtc cgctccgacg 2940 

tcgccaccga caacccggtc ccgaccccgc ccttccgcgg cacccgcgtc gtcaagggca 3000 
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tccagctcaa ggagtacgcc tcctggctcg acgagggcgc cctcttcaag ggccagtggg 3060 

gcc.tcaagca ggcccgcacc ggcgagggac cctcctacga ggaactggtc gagtccgagg 3120 

gccggccgcg gctgcgcggc ctgctcgacc ggctccagac ggacaacctt tt^gaggcgg 3180 

ccgtggtcta cggctacttc ccctgcgtct ccaaggacga cgacctgatc gtcctcgacg 3240 

acgacggcaa cgaacgcacc cgcttcacct tcccccgcca gcgccgcggq* cggcgcctgt 3300 

gcctggccga cttcttccgc ccggaggagt ccggcgagac cgacgtggtc ggcttccagg 3360 

tcgtcaccgt cggctcccgc atcggcgagg agacggcccg catgttcgag gccaacgcct 3420 

accgcgacta tctcgagctg cacggcctgt ccgtgcagct cgccgaggcc ctcgccgagt 3480 

actggcacgc gcgcgtgcgc tcggaactcg gcttcgccgg ggaggacccg gccgagatgg 3540 

aggacatgtt cgccctgaag taccggggtg cccgcttctc cctcggctac ggcgcctgcc 3600 

ccgacctgga ggaccgcgcc aagatcgccg ccctgctgga gcccgagcgc atcggcgtcc 3660 

acctatccga ggagttccag ctccaccccg agcagtccac cgacgccatc gtcatccacc 3720 

acccggaggc caagtacttc aacgcccgct gagggatatc gtcgacatcg atgctcttct 3780 

gcgttaatta acaattggga tcctctagac ccgggattta aatcgctagc gggctgctaa 3840 

aggaagcgga acacgtagaa agccagtccg cagaaacggt gctgaccccg gatgaatgtc 3900 

agctactggg ctatctggac aagggaaaac gcaagcgcaa agagaaagca ggtagcttgc 3960 

agtgggctta catggcgata gctagactgg gcggttttat ggacagcaag cgaaccggaa 4020 

ttgccagctg gggcgccctc tggtaaggtt gggaagccct gcaaagtaaa ctggatggct 4080 

ttcttgccgc caaggatctg atggcgcagg ggatcaagat ctgatcaaga gacaggatga 4140 

ggatcgtttc gcatgattga acaagatgga ttgcacgcag gttctccggc cgcttgggtg 4200 

gagaggctat tcggctatga ctgggcacaa cagacaatcg gctgctctga tgccgccgtg 4260 

ttccggctgt cagcgcaggg gcgcccggtt ctttttgtca agaccgacct gtccggtgcc 4320 

ctgaatgaac tgcaggacga ggcagcgcgg ctatcgtggc tggccacgac gggcgttcct 4360 

tgcgcagctg tgctcgacgt tgtcactgaa gcgggaaggg actggctgct attgggcgaa 4440 

gtgccggggc aggatctcct gtcatctcac cttgctcctg ccgagaaagt atccatcatg 4500 

gctgatgcaa tgcggcggct gcatacgctt gatccggcta cctgcccatt cgaccaccaa 4560 

gcgaaacatc gcatcgagcg agcacgtact cggatggaag ccggtcttgt cgatcaggat 4620 

gatctggacg aagagcatca ggggctcgcg ccagccgaac tgttcgccag gctcaaggcg 4680 

cgcatgcccg acggcgagga tctcgtcgtg acccatggcg atgcctgctt gccgaatatc 4740 

atggtggaaa atggccgctt ttctggattc atcgactgtg gccggctggg tgtggcggac 4800 

cgctatcagg acatagcgtt ggctacccgt gatattgctg aagagcttgg cggcgaatgg 4860 
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gctgaccgct tcctcgtgct ttacggtatc gccgctcccg attcgcagcg catcgccttc 4920 

tatcgccttc ttgacgagtt cttctgagcg ggactctggg gttcgaaatg accgaccaag 4980 

cgacgcccaa cctgccatca cgagatttcg attccaccgc cgccttctat gaaaggttgg 5040 

gcttcggaat cgttttccgg gacgccggct ggatgatcct ccagcgcggg gatctcatgc 5100 

tggagttctt cgcccacgct agcggcgcgc cggccggccc ggtgtgaaat accgcacaga 5160 

tgcgtaagga gaaaataccg catcaggcgc tcttccgctt cctcgctcac tgactcgctg 5220 

cgctcggtcg ttcggctgcg gcgagcggta tcagctcact caaaggcggt aatacggtta 5280 

tccacagaat caggggataa cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc 5340 

aggaaccgta aaaaggccgc gttgctggcg tttttccata ggctccgccc ccctgacgag 5400 

catcacaaaa atcgacgctc aagtcagagg tggcgaaacc cgacaggact ataaagatac 5460 

caggcgtttc cccctggaag ctccctcgtg cgctctcctg ttccgaccct gccgcttacc 5520 

ggatacctgt ccgcctttct cccttcggga agcgtggcgc tttctcatag ctcacgctgt 5580 

aggtatctca gttcggtgta ggtcgttcgc tccaagctgg gctgtgtgca cgaacccccc 5640 

gttcagcccg accgctgcgc cttatccggt aactatcgtc ttgagtccaa cccggtaaga 5700 

cacgacttat cgccactggc agcagccact ggtaacagga ttagcagagc gaggtatgta 5760 

ggcggtgcta cagagttctt gaagtggtgg cctaactacg gctacactag aaggacagta 5820 

tttggtatct gcgctctgct gaagccagtt accttcggaa aaagagttgg tagctcttga 5880 

tccggcaaac aaaccaccgc tggtagcggt ggtttttttg tttgcaagca gcagattacg 5940 

cgcagaaaaa aaggatctca agaagatcct ttgatctttt ctacggggtc tgacgctcag 6000 

tggaacgaaa actcacgtta agggattttg gtcatgagat tatcaaaaag gatcttcacc 6060 

tagatccttt taaaggccgg ccgcggccgc gcaaagtccc gcttcgtgaa aattttcgtg 6120 

ccgcgtgatt ttccgccaaa aactttaacg aacgttcgtt ataatggtgt catgaccttc 6180 

acgacgaagt actaaaattg gcccgaatca tcagctatgg atctctctga tgtcgcgctg 6240 

gagtccgacg cgctcgatgc tgccgtcgat ttaaaaacgg tgatcggatt tttccgagct 6300 

ctcgatacga cggacgcgcc agcatcacga gactgggcca gtgccgcgag cgacctagaa 6360 

actctcgtgg cggatcttga ggagctggct gacgagctgc gtgctcggcc agcgccagga 6420 

ggacgcacag tagtggagga tgcaatcagt tgcgcctact gcggtggcct gattcctccc 6480 

cggcctgacc cgcgaggacg gcgcgcaaaa tattgctcag atgcgtgtcg tgccgcagcc 6540 

agccgcgagc gcgccaacaa acgccacgcc gaggagctgg aggcggctag gtcgcaaatg 6600 

gcgctggaag tgcgtccccc gagcgaaatt ttggccatgg tcgtcacaga gctggaagcg 6660 

gcagcgagaa ttatcgcgat cgtggcggtg cccgcaggca tgacaaacat cgtaaatgcc 6720 
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gcgtttcgtg tgccgtggcc gcccaggacg tgtcagcgcc gccaccacct gcaccgaatc 67B0 

ggcagcagcg tcgcgcgtcg aaaaagcgca caggcggcaa gaagcgataa gctgcacgaa 6640 

tacctgaaaa atgttgaacg ccccgtgagc ggtaactcac agggcgtcgg ctaaccccca 6900 

gtccaaacct gggagaaagc gctcaaaaat gactctagcg gattcacgag acattgacac 6960 

accggcctgg aaattttccg ctgatctgtt cgaicacccat cccgagctcg cgctgcgatc 7020 

acgtggctgg acgagcgaag accgccgcga attcctcgct cacctgggca gagaaaattt 7080 

ccagggcagc aagacccgcg acttcgccag cgcttggatc aaagacccgg acacggagaa 7140 

acacagccga agttataccg agttggttca aaatcgcttg cccggtgcca gtatgttgct 7200 

ctgacgcacg cgcagcacgc agccgtgctt gtcctggaca ttgatgtgcc gagccaccag 7260 

gccggcggga aaatcgagca cgtaaacccc gaggtctacg cgattttgga gcgctgggca 7320 

cgcctggaaa aagcgccagc ttggatcggc gtgaatccac tgagcgggaa atgccagctc 7380 

atctggctca ttgatccggt gtatgccgca gcaggcatga gcagcccgaa tatgcgcctg 7440 

ctggctgcaa cgaccgagga aatgacccgc gttttcggcg ctgaccaggc tttttcacat 7500 

aggctgagcc gtggccactg cactctccga cgatcccagc cgtaccgctg gcatgcccag 7560 

cacaatcgcg tggatcgcct agctgatctt atggaggttg ctcgcatgat ctcaggcaca 7620 

gaaaaaccta aaaaacgcta tgagcaggag ttttctagcg gacgggcacg tatcgaagcg 7680 

gcaagaaaag ccactgcgga agcaaaagca cttgccacgc ttgaagcaag cctgccgagc 7740 

gccgctgaag cgtctggaga gctgatcgac ggcgtccgtg tcctctggac tgctccaggg 7800 

cgtgccgccc gtgatgagac ggcttttcgc cacgctttga ctgtgggata ccagttaaaa 7B60 

gcggctggtg agcgcctaaa agacaccaag ggtcatcgag cctacgagcg tgcctacacc 7920 

gtcgctcagg cggtcggagg aggccgtgag cctgatctgc cgccggactg tgaccgccag 7980 

acggattggc cgcgacgtgt gcgcggctac gtcgctaaag gccagccagt cgtccctgct 8040 

cgtcagacag agacgcagag ccagccgagg cgaaaagctc tggccactat gggaagacgt 8100 

ggcggtaaaa aggccgcaga acgctggaaa gacccaaaca gtgagtacgc ccgagcacag 8160 

cgagaaaaac tagctaagtc cagtcaacga caagctagga aagctaaagg aaatcgcttg 8220 

accattgcag gttggtttat gactgttgag ggagagactg gctcgtggcc gacaatcaat 8280 

gaagctatgt ctgaatttag cgtgtcacgt cagaccgtga atagagcact taaggtctgc 8340 

gggcattgaa cttccacgag gacgccgaaa gcttcccagt aaatgtgcca tctcgtaggc 8400 

agaaaacggt tcccccgtag ggtctctctc ttggcctcct ttctaggtcg ggctgattgc 8460 

tcttgaagct ctctaggggg gctcacacca taggcagata acgttcccca ccggctcgcc 8520 

tcgtaagcgc acaaggactg ctcccaaaga tcttcaaagc cactgccgcg actgccttcg 8580 
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cgaagccttg ccccgcggaa atttcctcca ccgagttcgt gcacacccct atgccaagct 8640 

tctttcaccc taaattcgag agattggatt cttaccgtgg aaattcttcg caaaaatcgt 8700 

cccctgatcg cccttgcgac gttggcgtcg gtgccgctgg ttgcgcttgg cttgaccgac 8760 

ttgatcagcg gccgctcgat ttaaatc Q787 
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